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RECOMBINANT ADENOVIRUS AND ADENO-ASSOCIATED VIRUS, 
CELL LINES, AND METHODS OF PRODUCTION AND USE THEREOF 

This invention was supported by the National 
Institute of Health Grant Nos. HD32649-01, DK47757 and 
5 DK49136. The United States government has rights in this 
invention. 

Field of the Invention 

The present invention relates generally to the field 
of somatic gene therapy, and specifically to methods and 
10 compositions useful in the treatment of genetic 
disorders. 

Background of the Invention 

Adenoviruses are eukaryotic DNA viruses that can be 
modified to efficiently deliver a therapeutic or reporter 

15 transgene to a variety of cell types [see, e.g., M. S. 
Horwitz et al, "Adenoviridae and Their Replication" , 
Virology, second edition, pp. 1712, ed. B. N. Fields et 
al, Raven Press Ltd., New York (1990)]. Recombinant 
adenoviruses (rAds) are capable of providing extremely 

20 high levels of transgene delivery to virtually all cell 
types, regardless of the mitotic state. The efficacy of 
this system in delivering a therapeutic transgene in vivo 
that complements a genetic imbalance has been 
demonstrated in animal models of various disorders [K. F. 

25 Kozarsky et al, Somatic Cell Mol. Genet. . 19:449-458 
(1993) ("Kozarsky et al I") ; K. F. Kozarsky et al, 
Biol. Chem. , 2^9:13695-13702 (1994) ("Kozarsky et al II) 
and others]. The use of recombinant adenoviruses in the 
transduction of genes into hepatocytes in vivo has 

30 previously been demonstrated in rodents and rabbits [see, 
e.g., Kozarsky II, cited above, and S. Ishibashi et al, 
J. Clin. Invest. . 92:883-893 (1993)]. 

The first-generation recombinant, replication- 
deficient adenoviruses which have been developed for gene 

35 therapy contain deletions of the entire Ela and part of 
the Elb regions. This replication-defective virus is 
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grown on an adenovirus-transf ormed, complementation human 
embryonic kidney cell line containing a functional 
adenovirus Ela gene which provides a transacting Ela 
protein, the 293 cell [ATCC CRL1573]. El-deleted viruses 
5 are capable of replicating and producing infectious virus 
in the 293 cells, which provide Ela and Elb region gene 
products in trans. The resulting virus is capable of 
infecting many cell types and can express the introduced 
gene (providing it carries its own promoter) , but cannot 
10 replicate in a cell that does not carry the El region DNA 
unless the cell is infected at a very high multiplicity 
of infection. 

Adeno-associated virus (AAV) is an integrating human 
DNA parvovirus which has been proposed for use as a gene 

15 delivery vehicle for somatic gene therapy [B. J. Carter, 
in "Handbook of Parvoviruses", ed. , P. Tijsser, CRC 
Press, pp. 155-168 (1990)]. This small non-enveloped 
virus contains a 4.6 kb single stranded (ss) DNA genome 
that encodes sets of regulatory and capsid genes called 

20 rep and cap. Rep polypeptides (rep78, rep68, rep62 and 
rep40) are involved in replication, rescue and 
integration of the AAV genome. The cap proteins (VP1, 
VP2 and VP3) form the virion capsid. Flanking the rep 
and cap open reading frames at the 5» and 3« ends are 145 

25 bp inverted terminal repeats (ITRs) , the first 125 bp of 
which are capable of forming Y- or T-shaped duplex 
structures . 

Recombinant forms of AAV (rAAV) have been developed 
as vectors by replacing all viral open reading frames 

30 with a therapeutic minigene, while retaining the 

necessary cis elements contained in the ITRs. [See, 
e.g., US Patent Nos. 4,797,368; 5,153,414; 5,139,941; 
5,252,479; and 5,354,678; and International Publication 
Nos. WO 91/18088 published November 28, 1991; WO 93/24641 

35 published D cember 9, 1993 and W094/13788 published June 
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23, 1994], However, progress towards establishing AAV as 
a transducing vehicle for gene therapy has been slow for 
a variety of reasons* For example, the integrated 
provirus preferentially targets specific sites in 
5 chromosome 19. Additionally, difficulties surround large- 
scale production of replication defective recombinants. 
The cells employed to produce rAAV must also be infected 
with adenovirus or herpesvirus to provide the necessary 
helper functions, thereby producing problems in purifying 

10 recombinant AAV (rAAV) from contaminating virus in 

culture. Practical experience with purified recombinant 
AAV as a gene therapy vector has been disappointing, 
because the more purified the AAV is from co-infection 
with its helper virus in culture, the lower the gene 

15 transduction efficiencies that the rAAV displays. 

There ' remains a need in the art for additional 
recombinant adenoviruses and rAAV, therapeutic 
compositions and methods which enable effective use of 
these recombinant viruses in the treatment of disorders 

20 and diseases by gene therapy. 
Summary of the In vention 

In one aspect of this invention, a packaging cell 
line is provided which expresses adenovirus genes Ela, 
Elb and E4, or functional fragments thereof, e.g., the E4 

25 open reading frame (ORF) 6. 

In another aspect, the invention provides a rAd 
comprising the DNA of at least a portion of the genome of 
an adenovirus having functional deletions of the El and 
E4 gene regions; a suitable gene operatively linked to 

30 regulatory sequences directing its expression, and an 
adenovirus capsid, the rAd capable of infecting a 
mammalian cell and expressing the gene product in the 
cell in vivo or in vitro. The invention also provides a 
mammalian cell infected with the rAd described abov . 
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In still another asp ct, the invention provides a 
rAd shuttle vector comprising the DNA of at least a 
portion of the genome of an adenovirus having functional 
deletions of the El and E4 gene regions. 
5 In a further aspect, the invention provides a method 

for producing the above-described recombinant Ad and a 
method for delivering a selected gene into a mammalian 
cell using the recombinant Ad described above. 

In another aspect, the invention provides a method 

10 for enhancing the efficiency of transduction of a 

recombinant AAV into a target cell. The method operates, 
in brief, by infecting a target cell with a ss 
recombinant adeno-associated virus (rAAV) which comprises 
a transgene operatively linked to regulatory sequences 

15 directing its expression, and contacting the infected 

cells with an agent which facilitates the conversion of 
ss rAAV to its double stranded (ds) form. Conversion of 
ss rAAV to ds rAAV occurs in the target cell, resulting 
in enhanced transduction of the rAAV into the target 

20 cell. The agent may be a helper virus which carries a 
selected gene or functional fragment thereof encoding a 
polypeptide capable of enhancing the conversion of the ss 
rAAV to ds rAAV and which is co- infected into the same 
target cell. The agent may also be a drug or chemical 

25 composition which accomplishes the same function and is 
applied to the infected target cell. This method can 
operate both in an ex vivo setting and in vivo. 

In yet another aspect, the invention provides a 
novel recombinant AAV, which contains both the transgene 

30 intended for use in treating a genetic disease or 

disorder and at least one additional gene operatively 
linked to inducible or constitutive regulatory sequences. 
The additional gene(s) encodes a polypeptide capable of 
facilitating, alone or in concert with other additional 

35 genes, the conversion of ss rAAV to its ds form upon 
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expression. In a preferred embodiment, the additional 
gene is adenovirus E4 or a functional fragment thereof. 
Also disclosed is a method for enhancing the efficiency 
of transduction of the novel rAAV into a target cell. 
5 The novel rAAV and methods of this invention are 

also useful in pharmaceutical compositions for use in ex 
vivo and in vivo gene therapy treatment protocols for 
treating inherited diseases, cancer, and other genetic 
dysfunctions. 

10 Other aspects and advantages of the present 

invention are described further in the following detailed 
description of the preferred embodiments thereof. 
Brief Descriptio n of the Drawings 

Fig. 1 is a schematic drawing of an exemplary 

15 plasmid pMMTVE40RF6 [SEQ ID NO: 1] or pMTE40RF6, which 
contains an MMTV or sheep MT promoter, respectively, in 
control of a human E4 ORF 6 gene sequence, a growth 
hormone gene terminator sequence (GH) , an SV40 ori, 
pBR322-based plasmid sequences including a neo R gene, an 

20 SV40 polyA site and an amp R gene. 

Fig. 2 is a schematic map of rAd H5. OOlCBLac^ [SEQ 
ID NO: 3] with indicated restriction endonuclease enzyme - 
sites. The striated bar represents the CBLacZ minigene; 
the black bar represents Ad5 viral backbone, the 

25 crosshatched bar represents Ad E4 deletion*. 

Fig. 3 plots LacZ forming units (LFU) /ml vs time 
(hours) for E4 complementing cell lines infected with 
HS.OOlCBiacZ. 

Fig. 4A is a graph of the induction, ORF6 expression 
30 and viral production in 293-27-18 packaging cells 

plotting yield at 24 hours post-infection (pi) in LFU/ml 
and ORF 6 protein (abs.mm) vs. concentration of the 
inducer, dexamethasone (mM) . Abs.mm is the intensity of 
the size of the protein band on a Western blot and 
35 reflects absorbance and protein size in mm 2 . The square 
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is yield at 24 hours pi. The diamond is ORF6 protein 
detected at 24 hours pi. 

Fig. 4B is a similar graph to that of Fig. 4A, 
except that the packaging cells are 293-10-3 cells. The 
5 symbols are as described for Fig. 4A. 

Fig. 5A is a bar graph plotting B-galactosidase 
enzyme activity in lysates from infected Hela cells. The 
horizontal axis indicates the adenoviruses infected into 
the HeLa cells, with the symbol "+ M indicating the 
10 addition of the adenovirus to the rAAV f AV.CMVLacZ. The 
vertical axis indicates intracellular B-galactosidase 
specific activity (mUnits/mg protein) using ONPG. Below 
each bar, the fold- induction in specific activity 
relative to cells that received the AV.CMVLacZ vector 
15 alone is given. 

Fig. 5B is a bar graph plotting Ad multiplicity of 
infection (MOI) in HeLa cells of wild-type Ad5 or the E2 
mutant dl802 f the cells co-infected with rAAV vs. 
intracellular B-galactosidase specific activity. See 
20 Example 11. 

Fig. 6A is a graph in which B-galactosidase specific 
activity and counts per minute (CPM) are plotted along 
the vertical axis and adenovirus MOI f s are on the 
horizontal axis for HeLa cells infected with wtAdS and 
25 rAAV according to Example 12. Data obtained from low MOI 
(1, 5, and 10) infections are shown. 

Fig. 6B is a graph similar to that of Fig. 6A except 
that the cells were infected with Ad mutant dl802. 

Fig. 7A illustrates a model for leading strand 
30 synthesis of a complementary AAV strand in the presence 
of Rep (+Rep) or absence of Rep (-Rep) . Rep expresses a 
terminal resolution activity that can convert a duplex 
structure with closed-ends to an open-ended duplex. In 
the absence of R p, terminal resolution is impaired 
35 leaving the covalently closed, hairpin structures intact. 
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Under these conditions , hairpins are expected to be found 
leftward and rightward, since both strands of a rescued 
ds AAV genome are packaged into virions. 

Fig* 7B is a schematic of linear AV.CMVLacZ with 
5 labeled domains including the AAV ITRs, CMV immediate 
early enhancer/promoter (CMV) , SV40 splice donor-splice 
acceptor (SD/SA) , E. coli B-galactosidase cDNA (LacZ) , 
and SV40 polyA signal (pA) . Two NotI sites located at bp 
positions 1035 and 4509 are indicated. 
0 Fig. 7C illustrates a closed end and an open end 

fragment of rAV.CMVLac^. 

Figs. 7D, 7E and 7F indicate the mixture of 
open-ended and covalently closed duplex fragments 
generated by NotI digestion of ss AV.CMVLacZ at position . 
5 4509 in the absence of terminal resolution. The NotI 

4509 digestion provides a convenient means of releasing a 
361 bp fragment that contains the right ITR in the 
context of a hybridization target (i.e. SV40 pA) . In the: 
presence of terminal resolution, only the open-ended 361 - 
0 bp fragment would be expected to be generated (Fig. 7D) 
by such digestion. 

Fig. 8A is a bar graph plotting B-galactosidase 
specific activity (mUnits/mg protein) vs. increasing 
concentration of zinc (jtM) inducer for cell line 293 (MT- 
5 ORF6) transduced with AVCKVLacZ (first row below each 
bar) . Also provided is the fold- induction relative to 
293 cells (second row below each bar) , and the 
fold- induction relative to 293(ORF6) cells maintained in 
the absence of zinc (third row) . 
) Fig. 8B is a bar graph plotting CPM of duplex 

monomer replicative form (RFm) of rAAV vs. the 
concentration of zinc (/xM) used for induction and the 
fold-induction relative to 293(ORF6) cells maintained in 
0 mM zinc below each bar. 
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Fig. 8C is a graphical comparison of the induction 
profiles that describe AV.CMVLacZ transduction 
efficiency. Specific activity data from Fig. 8A and CPM 
data of AV.CMViacZ RFm from Fig. 8B are plotted along the 
5 vertical axis, and concentration of zinc sulfate used 

during the experiment is shown along the horizontal axis. 

Fig. 9 is a bar graph plotting specific activity 
(milliunits B-galactosidase/mg protein) vs the 
concentration of zinc used for induction (first row under 
10 the horizontal axis) , the fold-induction relative to HeLa 
cells (second row) , and the fold- induction relative to 
HeLa (Mt-ORF6 ) cells maintained in the absence of zinc 
(third row), for the HeLa (MT-ORF6 ) cells transduced at an 
MOI of 1,000 AV.CMVLacZ virus particles/cell in the 
15 absence of zinc sulfate inducer or in the presence of 50, 
100, 150, 200 or 250 MM zinc sulfate inducer . 

Fig. 10 is a schematic of the plasmid pAV.CMVLacZ 

[SEQ ID NO: 4] . 

Fig. 11 illustrates plasmid pAV.CMVALP.GRE-0RF6 [SEQ 

20 ID NO: 5]. 

Detailed Description of the Invention 

The present invention provides packaging cell lines, 
which enable the production of recombinant adenoviruses 
(rAd) functionally deleted in both the El and E4 genes. 

25 These rAd and methods which enable the therapeutic 

treatment of disorders with such rAds are disclosed. 
Novel "second generation" recombinant adeno-associated 
virus (rAAV) and methods for enhancing the transduction 
efficiency of rAAV containing a transgene for expression 

30 in a somatic gene therapy protocol are also provided. 

The methods and compositions of this invention are useful 
in bx vivo applications of gene therapy, such as in the 
transduction of bone marrow cells with desirable 
hematopoi tic stem cell progenitor genes prior to bone 

35 marrow transplantation. The embodiments of the invention 
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are also useful in pharmaceutical compositions for direct 
in vivo treatment of patients by gene therapy vectors, 
including the transduction of desirable genes in patients 
with genetic disorders, such as cystic fibrosis. 
5 J. Packaging Cell Lines 

To increase the transgene capacity and decrease 
immune response of rAds, as many viral genes as possible 
should be deleted to inactivate the adenovirus. However, 
it is crucial to generate complementing cell lines for 

10 construction and propagation of such deleted Ad. The 

method and compositions of the present invention overcome 
several problems previously identified in the gene 
therapy for first generation El deleted adenoviruses and 
display advantages in administration particularly to 

15 muscle tissue. 

Early region 4 (E4) of Ad serotype 5 consists of 7 
ORFs believed to be involved in viral DNA replication, 
host cell shut-off, and late mRNA accumulation. To 
generate rAd deleted in E4 , the function of the E4 region 
must be supplied to the rAd by a helper virus or 
packaging cell line. However, useful packaging cell 
lines have not been available previously because normally 
the continuous expression of functioning Ad El and 
functional E4 in a single cell line are toxic to the 
25 cell. Such cells are therefore not useful for the growth 
and replication of rAds. Further, the DNA encoding the 
functional Ad El and Ad E4 genes, when present in a 
packaging cell line, can increase the chances of 
recombination with a rAd virus to cause the virus to 
3 0 revert to a wildtype Ad virus. 

The present invention avoids these problems by 
providing a packaging cell line which contains the Ad5 El 
gene and only the ORF 6 of the Ad5 E4 gene. 0RF6 of E4 
alone can provide the requirements for E4 in the viral 
35 life cycle. 



20 
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According to this invention, the ORF6 is preferably 
under the transcriptional control of an inducible 
promoter. The mouse mammary tumor virus (MMTV) promoter, 
inducible by a glucocorticoid, particularly, 
5 dexamethasone, is presently preferred. The DNA sequence 
of the MMTV promoter spans nucleotides 1-1506 of SEQ ID 
NO: 1. Another inducible promoter is the sheep 
metallothionine (MT) promoter, inducible by zinc [M. G. 
Peterson et al, J. Biochem. . 121:417-424 (1988)]. 

10 However, the zinc sulfate inducer of the MT promoter can 
itself be toxic to the cells. Other inducible promoters, 
such as those identified in International patent 
application W095/13392, published May 18, 1995, and 
incorporated by reference herein may also be used in the 
15 production of packaging cell lines according to this 
invention. Constitutive promoters, such as the 
constitutive Ad5 E4 region promoter, LTR, may be employed 
in control of the expression of ORF6. 

The packaging cell line of the invention which 
20 utilizes an inducible promoter permits one to control the 
development of toxicity by regulating the expression of 
the E4 0RF6 gene. After the desired shuttle vector 
containing the Ad sequences is transfected into the cell 
line, expression of the E4 ORF6 can be induced by the 
25 appropriate inducer. The packaging cell is thus able to 
provide both Ad El and Ad E4 ORF6 gene products to the 
rAd for a sufficient period to allow productive infection 
and recovery of the rAd, before the cell becomes toxic. 
At present, the time period before the cell experiences 
30 toxicity is about 10 days. 

In its most preferred form, the packaging cell line 
is a human embryonic kidney (HEK) 293 El expressing cell 
line into which is introduced the E4 ORF 6 sequence under 
the control of the inducible promoter. It should be 
35 underst od by one of skill in the art that another parent 
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cell line may be selected for the generation of a novel 
cell line expressing the Ela, Elb, and £4 ORF6 genes of a 
selected adenovirus serotype. Among such parent cell 
lines may be included HeLa [CCL 2], A549 [CCL 185], KB 
5 [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 
[ATCC CCL 75] cells. These cell lines are all available 
from the American Type Culture Collection, 12301 Parklawn 
Drive, Rockville, MD, USA. Other suitable parent cell 
lines may be obtained from other sources. If such parent 

10 cell lines were selected for modification, the cell line 
would need to be further supplied with the Ela and Elb 
gene functions, e.g., such as by transfection with a 
plasmid containing these genes or functional fragments 
thereof under a suitable promoter, as well as with the 

15 ORF6 gene as described herein. 

Example 1 teaches construction of packaging cell 
lines containing only the ORF 6 of Ad5 E4 region or, for 
functional comparisons, the entire E4 region. Briefly 
described, the entire E4 region and an ORF6 sequence of 

20 Ad 5 E4 gene were obtained by known techniques [see, 

e.g., Sambrook et al., "Molecular Cloning. A Laboratory 
Manual.", 2d edit., Cold Spring Harbor Laboratory , New 
York (1989) and references cited therein]. To isolate 
the ORF6 region, the anchored PCR technique was used to 

25 amplify the ORF6 sequence from its initiation codon to 
its termination codon. Primers selected from the 
published sequence of ORF 6 were used to amplify the ORF 
sequence and insert restriction sites onto the end of the 
sequence. The E4 ORF 6 sequence itself is reproduced as 

30 nucleotides 1523 through 2408 of SEQ ID NO: 1. The 
entire E4 gene sequence is published in the Genbank 
sequence of Ad5 [Genbank Accession No. M73260] . 

A minigene was constructed that placed the ORF6 
sequence under the control of a selected promoter. By 

35 "minigene" as used here is meant the combination of the 
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desired s guence to be expr ss d (in this particular 
instance, the ORF6 sequence) and the other regulatory 
elements necessary to transcribe the desired sequence and 
express the gene product in a cell containing that 
5 minigene. The ORF6 sequence gene is operatively linked 
to regulatory components in a manner which permits its 
transcription. Such components include conventional 
regulatory elements, such as a promoter to drive ORF6 
expression. One inducible promoter was the Zn +2 

10 inducible MT promoter; the other was the dexamethasone- 
inducible MMTV promoter of SEQ ID NO: 1. 

The minigene also contains nucleic acid sequences • 
heterologous to the ORF6 viral sequence, including 
sequences providing signals required for efficient 

15 polyadenylation of the transcript (poly-A or pA) . A 
common poly-A sequence which is employed in this 
invention is that derived from the growth hormone (GH) 
gene terminator sequence (nuc. 2409-3654 of SEQ ID NO: 
1) . The poly-A sequence generally is inserted in the 

20 minigene following the 0RF6 sequence. The polyA sequence 
employed in the MMTV-ORF6 minigene described in Example 1 
[SEQ ID NO: 1] is supplied by the GH gene terminator and 
an SV40 origin of replication (ori) . A similar minigene 
differing in promoter sequence, polyA sequence and/or 

25 SV40 ori can also be designed by one of skill in the art 
to transfer the E4 ORF6 sequence to a shuttle plasmid. 
Selection of these and other common vector elements are 
conventional [see, e.g., Sambrook et al, cited above, and 
references cited therein] and many such sequences are 

30 available from commercial and industrial sources as well 
as from Genbank. 

The ORF6-containing minigene was subcloned into a 
pBR322-based shuttle plasmid that contained a neomycin 
resistance gene, r suiting in the shuttle vector of Fig. 

35 1. Any of the many known bacterial shuttle vectors may 
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be employed to carry the minigene, providing that the 
vector contains a reporter gene or selectable marker of 
which many, e.g., neo, amp or purimycin, are known in the 
art. It is expected that one of skill in the art can 
5 develop other suitable shuttle vectors using other 
plasmid components which are similarly capable of 
transferring the ORF6 minigene into the chromosome of a 
cell transfected with the plasmid. 

As further described in Example 1, other shuttle 
10 vectors were designed for comparative purposes, which 
contain the complete or substantially complete Ad5 E4 
region under the control of the constitutive retroviral 
MLV LTR sequence in the presence or absence of the 
endogenous E4 promoter. The shuttle plasmid carrying the 
15 ORF6 minigene (or the entire E4 region) was introduced 

into HEK 293 cells which express the Ad El gene products. 
Complementing cell lines were generated that express 
these Ad E4 or ORF6 genes from either their endogenous 
promoters or heterologous inducible promoters. These 
cell lines are further characterized by their genetic 
constitution, E4 protein synthesis, recombinant AAV 
helper function, relative plaque efficiency of H5dll004 - 
virus, and growth kinetics of recombinant E1/E4 deleted 
adenovirus. These characteristics of exemplary E1/E4 
25 expressing packaging cell lines are discussed in detail 
in the following examples. 

II. Recombinant Adenovirus 

The E1/E4 expressing cell line is useful in 
constructing E1/E4 deleted rAds which can deliver a 

30 suitable gene to mammalian cells and tissues. These rAd 
are functionally deleted in at least the Ela, Elb and E4 
Ad gene regions. By the term "functionally deleted" is 
meant that a sufficient amount of the gene region is 
rem v d or otherwise damaged, .g., by mutation or 

35 modification, so that the gene r gion is no longer 
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capable of producing the products of g ne xpression. If 
desired, the entire gene region may be removed. In in 
vivo experiments with the rAd grown in the packaging cell 
lines, the E1/E4 deleted rAd demonstrated utility 
5 particularly in transferring a transgene to a muscle 
cell. 

The adenovirus sequences used in the construction of 
the shuttle vectors, helper viruses, if needed, and rAd, 
and other components and sequences employed in the 

10 construction of the vectors and viruses described herein 
may be readily obtained from commercial or academic 
sources based on previously published and described 
sequences. Viral materials may also be obtained from an 
individual patient. The viral sequences and vector 

15 components may be generated by resort to the teachings 
and references contained herein, coupled with standard 
recombinant molecular cloning techniques known and 
practiced by those skilled in the art. Modifications of 
existing nucleic acid sequences forming the vectors, 

20 including sequence deletions, insertions, and other 

mutations taught by this specification may be generated 
using standard techniques. Similarly, the methods 
employed for the selection of viral sequences useful in a 
vector, the cloning and construction of the M minigene M 

25 and its insertion into a desired viral shuttle vector and 
the production of a recombinant infectious virus are 
within the skill in the art given the teachings provided 
herein. 

A. Construction of the TransgenB 
30 A "minigene" in this context is defined as 

above, except that the components of this minigene are 
designed to express the gene product ex vivo or in vivo* 
Such components include conventional regulatory elements 
nec ssary to drive expression of the transgen in a cell 
35 transfected with the rAd. For this minig ne, a selected 
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promoter is operatively linked to the transgene and 
located, with other regulatory elements, within the 
selected viral sequences of the recombinant vector. 
Selection of the promoter is a routine matter and is not 
a limitation of this invention. Useful promoters may be 
constitutive promoters or regulated (inducible) 
promoters, which will enable control of the amount of the 
transgene to be expressed. For example, a desirable 
promoter is that of the cytomegalovirus (CUV) immediate 
early promoter/ enhancer [see, e.g., Boshart et al, ■ Call . 
11:521-530 (1985)]. Another desirable promoter includes 
the Rous sarcoma virus LTR promoter/ enhancer . Still 
another promoter/ enhancer sequence is the chicken 
cytoplasmic 6-actin (CB) promoter [T. A. Kost et al, 
Nucl. Acids Res. , 11(23) :8287 (1983)]. Other suitable 
promoters may be selected by one of skill in the art. 

The minigene may also desirably contain nucleic 
acid sequences heterologous to the viral vector sequences 
including poly-A sequences and introns with functional 
splice donor and acceptor sites, as described above. The 1 
poly-A sequence generally is inserted in the minigene 
following the transgene sequences and before the 3» - 
adenovirus sequences. A minigene of the present 
invention may also contain an intron, desirably located 
between the promoter /enhancer sequence and the transgene. 
Selection of these and other common vector elements are 
conventional as described above and many such sequences 
are available from commercial and industrial sources as 
well as from Genbank. 

As above stated, the minigene is located in the 
site of any selected deletion in the rAd. In the E1/E4 
deleted rAd H5. OOiCBiacZ, the transgene is located in the 
deleted El gene region. However, the transgene may be 
located elsewhere in the adenovirus sequence, as desired. 
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B. Production of Recombinant Adenovirus 
Adenovirus sequences useful in this invention 
may include the DNA sequences of a number of adenovirus 
types, which are available from Genbank, including type 
5 Ad5 [Genbank Accession No. M73260] . The adenovirus 
sequences may be obtained from any known adenovirus 
serotype, such as serotypes 2, 3, 4, 7, 12 and 40, and 
further including any of the presently identified 41 
human types [see, e.g., Horwitz, cited above]. Similarly 
10 adenoviruses known to infect other animals may also be 

employed in the vector constructs of this invention. The 
selection of the adenovirus type is not anticipated to 
limit the following invention. A variety of adenovirus 
strains are available from the American Type Culture 
15 Collection, Rockville, Maryland, or available by request 
from a variety of commercial and institutional sources. 
In the following exemplary embodiment an adenovirus, type 
5 (Ad5) is used for convenience. 

However, it is desirable to obtain a variety of 
20 adenovirus shuttle vectors based on different human 

adenovirus serotypes. It is anticipated that a library 
of such plasmids and the resulting rAds would be useful 
in a therapeutic regimen to evade cellular, and possibly 
humoral, immunity, and lengthen the duration of transgene 
25 expression, as well as improve the success of repeat 

therapeutic treatments. Additionally the use of various 
serotypes is believed to produce rAd with different 
tissue targeting specificities. Additionally, the 
absence of adenoviral genes El and E4 in the rAd of this 
3 0 invention should reduce or eliminate adverse CTL 

responses which normally cause destruction of rAds 
deleted of only the El gene. 

rAds of this invention are recombinant, 
defective adenoviruses (i.e., El deleted) which are also 
35 deleted completely or functionally of the E4 gene region. 
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Functional deletions of £4 gene regions may be assessed 
by assays of Examples 2 and 3, among other assays. rAds 
useful in this invention may optionally bear other 
mutations, e.g., temperature sensitive (ts) mutations in 
5 the E2a gene region, and deletions in the E3 gene 
regions . 

An adenovirus of this invention contains a 
functional deletion of the adenoviral early immediate 
early gene Ela (which spans mu 1.3 to 4.5) and delayed 

10 early gene Elb (which spans mu 4.6 to 11.2) . Similarly 
the adenovirus has a functional deletion of the whole E4 
region (which spans mu 92 to 97.2), or of at least 0RF6 
of the E4 region. Gene regions which may be optionally 
deleted in the E1/E4 deleted rAd of this invention 

15 include all or a portion of the adenovirus delayed early 
gene E3 (which spans mu 76.6 to 86.2). The function of 
E3 is irrelevant to the function and production of the 
rAd. 

The rAd of this invention may also have a 

20 mutation which results in reduced expression of 

adenoviral protein and/or reduced viral replication. For 
example, a ts mutation may be introduced into the 
adenovirus delayed early gene E2a (which spans mu 67.9 to 
61.5). Among such mutations include the incorporation of 

25 the missense ts mutation in the (DBP) E2a region found in 
the Ad5 H5tsl25 strain [P. Vander Vliet et al, J. Virol. . 
15:348-354 (1975)] at 62.5 mu. A single amino acid 
substitution (62.5 mu) at the carboxy end of the 72 kd 
protein produced from the E2a gene in this strain 

30 produces a protein product which is a ss DNA binding 

protein and is involved in the replication of adenoviral 
genomic DNA. At permissive temperatures (approximately 
32 °C) the ts strain is capable of full life cycle growth 
on HeLa cells, while at non-permissive temperatur s 

35 (approximately 38 °C) no replication of adenoviral DNA is 
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seen. In addition, at non-permissive temperatur s, 
decreased immunoreactive 72 kd protein is seen in HeLa 
cells. See, e.g., J. F. Engelhardt et al, Hum. Gene 
Ther. . 1:1217-1229 (1994); J. F. Engelhardt et al. Prop, 
5 Natl. Acad. Sci. > USA . 91:6196-6200 (1994) and 

International patent application W095/13392, published 
May 18, 1995, incorporated by reference herein. 

However, it should be understood that other 
deletions in the adenovirus genome as previously 

10 described in the art or otherwise may also occur in the 
rAds of this invention. One minimal type of rAd can 
contain adenovirus genomic sequences from which all viral 
genes are deleted. More specifically, the adenovirus 
sequences may be only the cis-acting 5 * and 3 • inverted 

15 terminal repeat (ITR) sequences of an adenovirus (which 
function as oris) and the native 5 1 packaging/ enhancer 
domain, that contains sequences necessary for packaging 
linear Ad genomes and enhancer elements for the El 
promoter. The adenovirus 5* sequence containing the 5' 

20 ITR and packaging/ enhancer region (Ad5 mu 0-1 or bp 1- 

360) can be employed as the 5 1 adenovirus sequence in rAd 
of this invention. The 3 1 adenovirus sequences including 
the right terminal (3 1 ) ITR sequence of the adenoviral 
genome spanning about bp 35,353 - end of the adenovirus 

25 genome, or map units "98.4-100 may be desirably employed 
as the 3 v sequence of the rAd . These sequences , which 
are clearly devoid of the El and E4 genes, can flank, or 
be operatively associated with the minigene in a rAd. 
Any other necessary Ad gene products will then be 

30 supplied by helper viruses and the E1/E4 ORF6 expressing 
packaging cell of this invention. 

Exemplary rAd for use in this invention, for 
example, may be obtained by homologous recombination of 
desired fragments from various rAd, a technique which has 

35 been commonly employed t generate other rAd for gene 
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therapy use* In the examples below, a representativ 
rAd, H5. 001CB£acZ, is constructed by homologous 
recombination between the adenovirus d!1004 (also 
H5dll004) viral backbone and pAdCBI>acZ minigene DNA. 
5 H5dll004 is an Ad5 virus deleted of from about map unit 
92.1 through map unit 98, i.e, substantially the entire 
£4 gene. The dl!004 virus is described in Bridge and 
Ketner, J. Virol. . 632 (2) : 631-638 (Feb. 1989). 

The pAdCBLacZ vector is a cDNA plasmid 

10 containing Ad m.u. 0-1, an El deletion into which is 
inserted a bacterial fi-galactosidase gene under the 
control of a chicken B-actin promoter, with other 
regulatory elements as described below, and flanked by Ad 
m.u. 9-16 and plasmid sequence. 

15 The production of the E1/E4 rAd of this 

invention in the packaging cell line of this invention 
utilizes conventional techniques. Such techniques 
include conventional cloning techniques of cDNA such as 
those described in texts [Sambrook et al, cited above], 

20 use of overlapping oligonucleotide sequences of the 

adenovirus genomes, PGR and any suitable method which 
provides the desired nucleotide sequence. Standard 
transfection and co-transf ection techniques are employed, 
e.g. , CaP0 4 transfection techniques using the 

25 complementation 293 cell line. Other conventional 

methods employed include homologous recombination of the 
viral genomes, plaquing of viruses in agar overlay, 
methods of measuring signal generation, and the like. 

For example, following the construction and 

30 assembly of the desired minigene-containing plasmid 

vector pAdCBLacZ, the E1/E4 expressing packaging cell 
line of this invention is infected with the helper virus 
H5dll004. The infected cell line is then subsequently 
transf ected with an adenovirus plasmid vector by 

35 conventional methods. Homologous recombination occurs 
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between the E4-d ltd H5dll004 helper and the pAdCBLacZ 
vector, which permits the adenovirus-transgene sequences 
in the vector to be replicated and packaged into virion 
caps ids, resulting in the rAd. About 30 or more hours 
5 post-transfection, the cells are harvested, an extract 
prepared and the rAd containing the LacZ transgene is 
purified by buoyant density ultracentrif ugation in a CsCl 
gradient . 

XJJ. Use of the Recombinant Virus in Gene Therapy 

10 The rAd containing the transgene produced by 

cooperation of the adenovirus vector and £4 deleted 
helper virus and packaging cell line, as described above, 
provides an efficient gene transfer vehicle which can 
deliver the transgene in a pharmaceutical composition to 

15 a patient in vivo or ex vivo and provide for integration 
of the gene into a mammalian cell. 

The rAd are administered to humans in a conventional 
manner for gene therapy and serve as an alternative or 
supplemental gene therapy for the disorder to which the 

20 transgene is directed. A rAd of this invention may be 
administered to a patient, preferably suspended in a 
biologically compatible solution or pharmaceutically 
acceptable delivery vehicle. A suitable vehicle includes 
sterile saline. Other aqueous and non-aqueous isotonic 

25 sterile injection solutions and aqueous and non-aqueous 
sterile suspensions known to be pharmaceutically 
acceptable carriers and well known to those of skill in 
the art may be employed for this purpose. 

The rAd are administered in sufficient amounts to 

30 transfect the desired target cells, e.g., muscle, liver, 
epithelial, etc. and provide sufficient levels of 
transfer and expression of the transgene to provide a 
therapeutic benefit without undue adverse or with 
medically acceptable physiological effects which can be 

35 determined by those skilled in the medical arts. 
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Conventional and pharmaceutically acceptable rout s of 
administration include direct delivery to the muscle or 
other selected cell, intranasal, intravenous, 
intramuscular, subcutaneous, intradermal, oral and other 
5 parental routes of administration. Routes of 
administration may be combined, if desired. 

Dosages of rAd will depend primarily on factors such 
as the condition being treated, the age, weight and 
health of the patient, and may thus vary among patients. 

10 For example, a therapeutically effective human dose of 
the rAd is generally in the range of from about 20 to 
about 100 ml of saline solution containing concentrations 
of from about IxlO 9 to lxlO 11 pfu/ml virus. A preferred 
human dose is estimated to be about 50 ml saline solution 

15 at 2xl0 10 pfu/ml. The dose will be adjusted to balance 
the therapeutic benefit against any side effects. The 
levels of expression of the transgene can be monitored to 
determine the frequency of administration. 

An optional method step involves the co- 

20 administration to the patient, either concurrently with, - 
or before or after administration of the rAd of a 
suitable amount of a short acting immune modulator. The 
selected immune modulator is defined herein as an agent 
capable of inhibiting the formation of neutralizing 

25 antibodies directed against the recombinant vector of 

this invention or capable of inhibiting or substantially 
delaying cytolytic T lymphocyte (CTL) elimination of the 
vector. Among desirable immune modulators are 
interleukin-12 [European Patent Application No. 441,900]; 

30 gamma interferon [S. C. Morris et al, J. Immunol . , 

152:1047 (1994)]; interleukin-4 [United States Patent No. 
5,017,691]; antibody to the CD4 protein, such as anti-OKT 
3+ [see, e.g., US Patent No. 4,658,019] or antibody GK1.5 
(ATCC Accession No. TIB207) ; a soluble CD40 molecule or 

35 an antibody to CD40 ligand (Bristol-Myers Squibb Co) 
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[European patent application 555,880, published August 
18, 1993]; a soluble form of B7 or an antibody to CD28 or 
CTIA4 [CTIA4-Ig (Bristol-Myers Squibb Co) , European 
patent application 606,217, published July 20, 1994], or 
5 agents such as cyclosporin A or cyclophosphamide. Thus, 
the pharmaceutical compositions and methods of this 
invention provide a desirable gene therapy treatment. 
IV. Recombinant Adeno-Associated Viirus 
In the following context the term "transgene" means 

10 a nucleic acid sequence or reverse transcript thereof, 
heterologous to the AAV sequence, which encodes a 
polypeptide or protein of interest. The transgene may be 
operatively linked to regulatory components in a manner 
which permits transgene transcription, i.e., the 

15 transgene is placed into operative association with a 

promoter, as well as other regulatory sequences, such as 
SV40 introns or polyA sequences, useful for its 
regulation. The composite association of the transgene 
with its regulatory sequences is referred to herein as a 

20 minicassette or minigene. 

The composition of the transgene or minicassette 
sequence will depend upon the use to which the resulting 
rAAV will be put. For example, one type of transgene 
sequence includes a reporter sequence, which upon 

25 expression produces a detectable signal. Such reporter 
sequences include without limitation, an E. coli 5- 
galactosidase (LacZ) cDNA, an alkaline phosphatase gene 
(ALP) and a green fluorescent protein gene. These 
sequences, when associated with regulatory elements which 

30 drive their expression, provide signals detectable by 
conventional means, e.g., ultraviolet wavelength 
absorbance, visible color change, etc. 

Another type of transgene sequence includes a 
therapeutic gene which expresses a desired gene product 

35 in a host cell. These th rapeutic nucleic acid sequences 
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typically encode products for administration and 
expression in a patient in vivo or bx vivo to replace or 
correct an inherited or non- inherited genetic defect or 
treat an epigenetic disorder or disease. Such transgenes 
5 may be readily selected by one of skill in this art and 
the design of the transgene or the minicassette for 
insertion into the rAAV is not a limitation of this 
invention. 

The term "rAAV" encompasses any recombinant AAV gene 
10 therapy vehicle of the prior art, including the AdAAV 

hybrid virus described in published International Patent 
Application No. W096/13598, published May 9, 1996. More 
specifically, rAAV defines a rAAV comprising: (a) the DNA 
of at least a portion of the genome of an AAV, which 
15 portion is capable of transducing into a target cell at 

least one selected gene in the absence of cell division; - 
and (b) at least one selected gene (or transgene) 
operatively linked to regulatory sequences directing its 
expression, the gene flanked by the DNA of (a) and 
20 capable of expression in the target cell in vivo or in 
vitro. 

Other rAAVs have been described in the art. The 
method of this invention is not limited by the precise 
nature of the AAV sequences used in the rAAV, provided 

25 that at a minimum both the 5 • and 3 1 AAV inverted 

terminal repeats are present. Thus, the rAAV may be 
selected by one of skill in the art, and is not itself a 
limitation on this invention. The rAAVs specifically 
disclosed herein are illustrative. 

30 By the term "transduction" is meant that the rAAV 

produced by practice of the invention is capable of 
infecting a desired target cell and expressing the 
transgene in the cell by harnessing the cell's machinery. 
Transduction may include stably integrating the viral DNA 

35 into a chromosome of the target cell. "Enhanced 
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transduction" is d fin d as the ability of the rAAV in 
the presence of a conversion agent to transduce the 
target cell, either in vitro, &x vivo or in vivo, at an 
efficiency greater than a typical prior art rAAV produced 
5 in, and purified from, a culture co- infected with an 
adenovirus or herpesvirus helper. 

This method is based on the observation that the 
limiting step in rAAV mediated transduction of cells for 
gene therapy is not the internalization or transfer of 

10 the ss viral genome, but rather the subsequent conversion 
of the single-stranded (ss) viral genome to a 
transcriptionally active double-stranded (ds) form. 
Formation of ds DNA intermediates is necessary for 
recombinant gene expression, which is likely to be 

15 modulated by viral and cellular factors through 

posttranscriptional mechanisms. The inventors have 
designed a method to overcome this rate-limiting step, 
thereby enhancing transduction ability of an rAAV and 
ultimately the use of rAAV in gene therapy protocols. 

20 This method of the present invention may employ a 

conventionally prepared ss rAAV containing a transgene. 
The prior art produces ss rAAV by co-infection in culture 
with a helper adenovirus or herpesvirus, followed by 
purifying the rAAV from the culture contaminants 

25 including the helper virus, and infecting the target cell 
with the rAAV alone. The present invention provides for 
infecting a target cell with a ss rAAV. However, once 
the target cell is infected, the infected cell is 
contacted with an agent which facilitates the conversion 

30 of the ss rAAV to the ds form of rAAV. The action of 
this "facilitating agent** or "conversion agent* 1 causes 
the ss to ds conversion to occur in the target cell, 
resulting in enhanced transduction of the recombinant AAV 
into the target cell. By facilitating the conversion of 

35 ss to ds rAAV in the target cell, the method of this 
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invention may also result: in both transduction and stable 
chromosomal integration of the rAAV into the chromosome 
of said host cell. 

Preferably , for use of this invention the 
5 "facilitating or conversion agent H may take several 
forms • 

A. The Conversion Agent is a Helper Virus 
In one embodiment, the agent is a helper virus 
and the method includes an additional step of co- 

10 infecting the target cell with the helper virus. The 
helper virus useful in this method contains a selected 
gene which can facilitate the conversion of ss rAAV to ds 
rAAV. The selected gene may encode a gene product or 
polypeptide (or a functional fragment of the polypeptide 

15 which shares the biological activity of the full-length 
polypeptide) which enhances the conversion. 
Alternatively, the selected gene may express an antisense 
or ribozyme which functions in the cell to block or 
inhibit a cellular gene that normally prevents ss to ds > 

20 conversion of the rAAV. These genes may also be employed 
in the second generation rAAV described below. 

The helper virus is capable of expressing the 
selected gene product in the target cell in the absence 
of cell division. The helper virus may be a wild-type or 

25 mutant adenovirus. The helper virus may alternatively be 
a wild-type or mutant herpesvirus. Preferably, for use 
as facilitating agents, such viruses are mutants deleted 
of several normal genes so that the helper viruses and /or 
their expressed gene products will not cause disease in a 

30 patient. 

For example, a helper adenovirus useful in this 
invention may express only a gene product of a single 
adenoviral early gene. -Exposure of the ss rAAV to an Ad 
early gene product is sufficient to substantially enhance 
35 the formation of ds rAAV genom with a c ordinate 
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increase in transduction efficiency* The Ad early genes 
which are useful in producing this effect are El, E2a, E4 
and functional fragments thereof. However, as 
demonstrated by the examples below, adenovirus 
5 substantially enhances recombinant AAV transduction In 
vitro in a way that is dependent on expression of the El 
and E4 genes of adenovirus and is directly proportional 
to the appearance of ds replicative forms of rAAV. 

One example of a helper virus is an adenovirus 

10 deleted of most of its wild-type early genes and which is 
capable of expressing only its E4 gene or a functional 
fragment thereof in the target cell. Among such 
functional fragments is the ORF 6 of the E4 gene. As 
described below in the examples, experiments in cell 

15 lines indicate that the ORF6 of the adenoviral E4 gene 
locus is sufficient to significantly enhance rAAV 
transduction. Selective expression of the E4-ORF6 
product of adenovirus accomplishes a increase in 
transduction efficiency similar to, but somewhat 

20 attenuated, compared to that produced by exposure to the 
El and E4 gene products in combination. That is, the 
ORF 6 product of E4 is sufficient to enhance the 
augmentation of rAAV transduction; but this effect is 
amplified substantially by El gene products. 

25 Thus, more preferably, exposure of the rAAV to 

both the expressed El and E4 gene products produces a 
substantial enhancement of the above-described rate 
limiting step. Therefore, another exemplary helper virus 
may also contain more than one gene which, upon 

30 expression, facilitates the ss to ds conversion. An 
example of such a helper virus is an adenovirus which 
expresses both the El and E4 genes, or functional 
fragments thereof. Still other Ad genes may be expressed 
by the helper virus, provided that the virus is 

35 sufficiently crippled so that it does not cause disease 
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in the patient: contributing the target cells. 

Where the agent which facilitates conversion of 
ss to ds rAAV is a helper virus, the method of the 
invention comprises co-infecting the target cell with the 
5 rAAV and the helper virus. Such co-infection may occur 
in the context of ex vivo therapy, i.e., manipulations 
performed on cells extracted from the patient, which 
cells are reinserted into the patient after the method is 
performed. Alternatively, the patient may be directly 

10 co-infected with the two viruses by conventional means. 
Delivery of the two viruses to the patient may be 
directed to a specific organ, or to the general 
circulatory system. Such delivery methods are described 
in the art for gene therapy of, e.g., cystic fibrosis 

15 [see, e.g., US Patent No. 5,240,846]. 

B • The Conversion Agent is a Chemical, Drug 
or Other Entity that can Activate rAAV Transduction 

In another embodiment of the method of this 
invention, the conversion agent which contacts the cells 

20 infected with the rAAV may be selected from the following, 
classes of known compounds or methods: 1) inhibitors of 
DNA synthesis such as hydroxyurea, hydrogen peroxide, and 
other direct or indirect inhibitors of DNA polymerase; 2) 
chemo- therapeutic agents that induce DNA damage, such as 

25 cyclophosphamide, alkylating agents, purine analogs, 

e.g., 6-thioguanine, etc.; 3) drugs that interfere with 
DNA modifying enzymes, such as inhibitors of 
topoisomerase, DNA ligase exonucleases and endonucleases; 
and 4) agents that nonspecif ically enhance transcription, 

30 such as sodium butyrate, or agents that stabilize cells, 
such as DMSO. Also, genotoxic agents such as carcinogens 
may be employed as the conversion agent. Other methods 
of inducing disruption or damage to DNA may also be 
useful as agents capable of facilitating ss to ds 

35 conversion of rAAV and maybe selected by one of skill in 
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the art, including physical methods, such as irradiation. 
These classes of compounds or methods are believed to 
result in the conversion from ss to ds rAAV. 

According to this embodiment of the method of 
5 the invention, the rAAV is again produced conventionally, 
but not co- infected with a helper virus. The ss rAAV is 
infected into the target cell, and the infected cell is 
contacted by the agent in an appropriate manner depending 
on the identity of the agent. These conversion enhancing 
10 agents can be employed in ex vivo treatment of the target 
cells infected by the rAAV by application directly to the 
cells. Such application can occur substantially 
simultaneously, or consecutively, with application of the 
rAAV gene therapy vehicle. For example, the infected 
15 target cell may be subjected to one of the above-listed 
compounds or drugs for a desired time period. The 
parameters for contacting the infected cells with the 
agent may readily be determined by one of skill in the 
art. These parameters will depend upon whether the 
20 method is performed ex vivo or in vivo. For example, the 
number of ex vivo infected cells to be treated will be 
considered for the dosage, and timing of such treatment. 

Similarly, the physical status of the patient 
can determine the parameters of delivery of the agent to 
25 the patient in vivo. The dosage and amount of the 

damaging agent may therefore be adjusted by one of skill 
in the art. Where the agents are typical 
Chemother apeutic drugs approved for use in humans or 
animals, such enhanced conversion of rAAV may also occur 
30 in vivo by the co-administration of the agent, i.e., the 
chemotherapeutic drug, and the rAAV gene therapy vehicle 
to the patient. According to this aspect of the 
invention, the chemotherapeutic drug would be 
administered only when the rAAV is administered. 
35 Appropriate dosages and amounts of chemotherapeutic drugs 
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and rec mbinant gene 'therapy vehicles and m ans £ r 
determining such amounts are within the skill of the art. 
However, because the effect of the chemo therapeutic drug 
will enhance the ss to ds conversion of the rAAV and thus 
5 enhance its efficiency of transduction into the target 
cells, it is anticipated that lower dosages than the 
conventional dosages of either or both the drug and the 
rAAV could be effectively administered. 

C. Conversion Agent May be Part of the rAAV. 

10 In still another embodiment of this invention, 

a novel " second generation" rAAV may be designed to 
incorporate the conversion agent into the virus, so that 
both the transgene and the conversion agent are co- 
expressed in the target cell. Such a novel recombinant 

15 adeno-associated virus comprises the following 
components : 

(a) the DMA of at least a portion of the 
genome of an adeno- associated virus which portion is 
capable of transducing at least two selected genes or 

20 functional fragments thereof into a target cell in the 
absence of cell division; (b) a first selected gene, 
i.e., the desired transgene, operatively linked to 
regulatory sequences directing its expression, and (c) a 
second selected gene, i.e., the "conversion gene" 

25 operatively linked to regulatory sequences capable of 
directing expression of said second gene. The 
"conversion gene" upon expression is capable of 
facilitating the conversion of the ss rAAV to its ds form 
upon expression. The first and second genes in this rAAV 

30 are flanked by the AAV DNA, preferably the 5 • and 3 f 

ITRs. An embodiment of such a second generation rAAV is 
provided schematically in Fig. 11. Its DNA sequence is 
provided in SEQ ID NO: 5. 

Another embodiment of such a novel rAAV may 

35 include more than one gene which upon expression has the 
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ability to facilitate conversion of ss to ds rAAV in the 
target cell. For example, the novel rAAV described above 
may also contain an additional selected gene operatively 
linked to regulatory sequences capable of directing its 
5 expression, the additional gene and said second 

"conversion" gene described above being capable of 
jointly facilitating the conversion of ss rAAV to its ds 
form upon expression of both the second and additional 
genes* In this rAAV, all three genes, i.e., the 

10 transgene, the second "conversion" gene and the 
additional gene are flanked by the AAV DNA. 

In one desirable embodiment of a novel rAAVm 
the AAV ITRs flank a selected transgene, and a conversion 
gene, which is the adenovirus E4 gene or a functional 

15 fragment thereof (e.g w the 0RF6 sequence). In another 
embodiment, the novel recombinant expresses three genes, 
the transgene, the adenovirus E4 gene or a functional 
fragment thereof and the adenovirus El gene or a 
functional fragment thereof. The E4 and El gene products 

20 expressed in the target cell with the transgene, together 
act to facilitate conversion of the ss to ds form of 
rAAV. 

In still another embodiment of the novel rAAV 
and its use, the regulatory sequences directing 

25 expression of the conversion gene, e.g., whether it be a 
single second gene or more than a single additional gene, 
may include an inducible promoter. Thus, expression of 
the conversion gene occurs only in the presence of an 
inducing agent. Many inducible promoters and companion 

30 inducing agents, e.g., steroids such as glucocorticoids, 
are known to the art and may be readily selected for 
incorporation into the rAAV and methods of this invention 
by one of skill in the art with resort to this 
description. 

35 The method of the invention employing such 
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H second generation" rAAVs which carry at: least one 
"conversion gene" provides for infecting the target cell 
with this ss rAAV. Where the promoters directing 
expression of both the transgene and the conversion gene 
5 are constitutive, the infected target cell machinery will 
direct the expression of the transgene product and 
conversion gene product. Co-expression in the target 
cell of the transgene and the "conversion gene" 
facilitates the conversion of ss rAAV to ds rAAV in the 

10 cell, and increases the transduction efficiency, and 

perhaps stable chromosomal integration, without further 
method steps. 

When the second generation rAAV employed in the , 
method contains the "conversion gene(s)" under the 

15 control of inducible promoter ( s) , the method is slightly ; 
altered. Following infection of the target cell by the 
rAAV, the infected target cell is contacted with a 
suitable inducing agent, which triggers the inducible 
promoter to "turn on" production of the conversion gene 

20 product. When the inducing agent is removed or stopped, *> 
the expression of the conversion gene product is "turned : 
off". 

As described above, any prior art rAAV 
containing a transgene for gene therapy may be used in at 

25 least one embodiment of the above methods. The sources, 
selection and assembly of the various components to 
generate the rAAV, including the novel rAAV described 
above, are now conventional and readily accessible to one 
of skill in thisart, given the disclosure contained 

30 herein. Such methods employ conventional genetic 

engineering techniques [See, e.g. Sambrook et al, cited 
above] . 

The novel rAAV viruses and the methods of this 
invention provide efficient gene transfer vehicl s for 
35 somatic gene therapy and are suitable in pharmaceutical 
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compositions for x vivo applications and in vivo use. 
When rAAV contain a therapeutic gene, e.g., in place of 
the LacZ transgene illustrated in the exemplary rAAV, 
AV.CMVIracZ, by use of the rAAV and the methods described 
5 herein, the therapeutic transgene can be delivered to a 
patient in vivo or ex vivo to provide for efficient 
transduction, and possibly stable integration, of the 
desired gene into the target cell. Thus, these novel 
rAAV and the methods described herein can be employed to 

10 correct genetic deficiencies or defects. The potential 
of AAV to efficiently integrate its genome into 
nondividing cells is currently being exploited in the 
development of gene therapies based on ex vivo 
transduction of hematopoietic stem cells. In vivo 

15 application of rAAV is primarily being developed for the 
treatment of CF where purified stocks of virus are 
instilled into the airway to transduce the terminally 
differentiated epithelial cells of conducting airway. 
The methods and compositions described herein can be used 

20 with both types of gene therapy. Another condition 

suitable for such use includes transduction of the low 
density lipoprotein (LDL) receptor gene into hepatocytes 
for the treatment of familial hypercholesterolemia. One 
of skill in the art can generate any number of rAAV which 

25 can be used via the above methods for the treatment of 
these and other disorders. 

For ex vivo or for in vivo therapy, the rAAV 
may be used to infect the target cells by suspending the 
virus particles in a biologically compatible solution or 

30 pharmaceutically acceptable delivery vehicle. A suitable 
vehicle includes sterile saline. Other aqueous and non- 
aqueous isotonic sterile injection solutions and aqueous 
and non- aqueous sterile suspensions known to be 
pharmaceutically acceptable carriers and well known to 
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those of skill in the art may be employed for this 
purpose . 

The rAAV are administered in sufficient amounts 
to transfect the desired cells and provide sufficient 
5 levels of expression of the selected transgene to provide 
a therapeutic benefit without undue adverse, or with 
medically acceptable, physiological effects which can be 
determined by those skilled in the medical arts. 
Conventional and pharmaceutically acceptable routes of in 

10 vivo administration include direct delivery to the target 
organ, tissue or site, intranasal, intravenous, 
intramuscular, subcutaneous, intradermal, oral and other 
parental routes of administration. Routes of 
administration may be combined, if desired. 

15 Dosages of the rAAV for the infecting step of 

the method will depend primarily on factors such as the 
therapeutic environment, i.e., ex vivo or in vivo; the 
condition being treated, the selected gene, the age, 
weight and health of the patient, and may thus vary among 

20 patients. A therapeutically effective dosage of the rAAV 
for ex vivo treatment will be based upon the multiplicity 
of infection, which is likely to range from between about 
1 to about 10 transducing particles/cell. A 
therapeutically effective human dosage of the rAAV for in 

25 vivo infection according to the present invention is 

believed to be in the range of from about 20 to about 50 
ml of saline solution containing concentrations of from 
about 1 x 10 7 to 1 x 10 10 transducing viral particles/ml 
virus. A preferred human dosage is about 20 ml saline 

30 solution at the above concentrations. The dosage will be 
adjusted to balance the therapeutic benefit against any 
side effects. The levels of expression of the selected 
gene can be monitored to determine the selection, 
adjustment or frequency of dosage administration. 
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Th effective amount of th facilitating agent 
to be administered is within the skill of the art to 
determine and will depend upon the identity of the agent. 
Known dosages of certain of the classes of chemicals and 
5 pharmaceuticals described above may be employed in this 
method to damage the DNA and facilitate ss to ds 
conversion of the rAAV. Where the agent is a gene 
expressed by a helper virus, the amounts of infecting 
virus should be similar to those amounts described above 

10 for the rAAV. Of course, where the agent is a gene 
present in a second generation rAAV, the identical 
dosages described above for the rAAV will apply. 

Several embodiments of the above-described 
methods of this invention were confirmed in murine models 

15 of rAAV mediated gene transfer to both lung and liver. 
These experiments demonstrated similarly low levels of 
gene transfer in vivo by rAAV, which was increased 
several orders of magnitude by coinfection with El and E4 
expressing adenovirus. 

20 In summary, experiments were conducted to 

demonstrate that adenovirus enhances rAAV transduction in 
cultured cells. During the production and 
characterization of a lacZ recombinant AAV generated in 
293 cells that were co infected with an El deleted virus, 

25 it was observed that purification of rAAV from lysates 

was associated with substantial loss of lacZ transducing 
activity when assayed on 293 cells. This drop in rAAV 
activity was particularly evident in the final step where 
residual contaminating helper adenovirus was removed by 

30 heat inactivation. LacZ transducing activity was 

recovered by adding adenovirus back to the purified stock 
of rAAV. These data provided the first indication that 
adenovirus could substantially enhance the transduction 
fficiency of rAAV. 
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As described in Example 10, a series of 
complementation groups were generated by mixing different 
adenovirus early gene mutants with purified LacZ rAAV, 
referred to as AV.CMVLacZ (see Example 2) . These defined 
5 mixtures of viruses were analyzed for La.cZ transduction 
on Hela cells (See Examples 12 and 13) . An El deletion 
rAd H5.CBALP and the E4 deletion mutant dll004 provided 
no significant increase in AV.CMVLacZ transduction (Fig. 
5A) . However, partial activity could be achieved with El 

10 and E4 mutants that carried less severe deletions. Both 
dlllO (ElB-55kDa deleted) and dllOlO (0RF6 deleted) 
enhanced transduction to levels that approached those of 
Ad5, tsl25, and dl802 in terms of the number of positive 
blue cells, but total 6-galactosidase activity was 

15 substantially lower (Fig. 5A) . These results implicate 
early regions El and E4 in the augmentation of rAAV 
transduction . 

The experiments described below also 
demonstrate that the novel rAAV which incorporates as its 

20 conversion gene, an Ad gene, such as E4, can increase 

transduction efficiency of the rAAV in the absence of a 
helper virus. As described in more detail below in 
Example 15 below, 293 cells were stably transfected with 
a genomic fragment of Ad5 spanning E4. This E1/E4 

25 expressing cell line and the parent El expressing cell 
line (293) were infected with rAAV and analyzed for 
transduction. These experiments demonstrated the 
significance of the combined expression of El and 
E4(ORF6) in the adenovirus mediated augmentation of rAAV 

30 transduction. 

In the presence of El and E4 expression, rAAV 
transduction was invariably accompanied by the appearance 
of ds RF monomers and dimers (Example 14) • Importantly, 
the tight correlation between rAAV vector transduction 

35 and the accumulation of duplex forms could be achiev d in 
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two differ nt exp rimental settings; cells infect d with 
E1/E4 expressing adenovirus (Figs. 8A and 8B) , or 
complementing cell lines (Fig* 8C) . 

The following examples illustrate the 
5 construction and testing of the novel packaging cell 

lines, the E1/E4 deleted rAd of the present invention and 
the use thereof, improved methods and second generation 
recombinant AAV production for gene therapy of the 
present invention. These examples are illustrative only, 
10 and do not limit the scope of the present invention. 

Example 1 - Novel Ela/Elb and E4 Expressing Packaging 
Cell Lines 

A. Construction of E4 ORF 6 Expressing Plasmids 
The entire E4 region from Ad5 or an ORF6 

15 minigene were subcloned into a shuttle plasmid that 

contained a neomycin resistance gene. Two versions of 
ORF6 minigene were developed that differed in the 
promoter element. The first used a Zn+2 inducible sheep 
metallothionine (MT) promoter to drive ORF 6 expression. 

20 The second used a dexamethasone-inducible mouse mammary 
tumor virus (MHTV) promoter. 

An exemplary plasmid useful for the 
construction of a packaging cell line of this invention 
is pMMTVE40RF6 • The minigene contained in this plasmid 

25 is set out in SEQ ID NO: 1, and contains a mouse mammary 
tumor virus promoter (MMTV) (nucleotides 1-1506 of SEQ ID 
NO:l) in transcriptional control of a human E4 ORF 6 gene 
sequence (nucleotides 1523-2408 of SEQ ID NO: 1) , a 
growth hormone terminator (GH) (nucleotides 2409-3654 of 

30 SEQ ID NO: 1) , an SV40 origin of replication, plasmid 
sequences from plasmid pBR322, including a neomycin 
resistance gene, and an ampicillin resistance gene. The 
amino acid sequence of ORF 6 is indicated in SEQ ID NO: 
2* The various functional fragments of this plasmid may 

35 be readily replaced with other conventionally used 
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s quences and are n t critical to the design of the 
plasmid. 

Another plasmid useful for the construction of 
a packaging cell line of this invention is pMTE40RF6. 
5 The DNA sequence of the minigene contained in this 

plasmid is similar to that of SEQ ID NO: 1, except that 
the promoter is a sheep metallothionine promoter (MT 
promoter) [H. G. Peterson et al, cited above]. 

A plasmid used as a control for the 
10 construction of a packaging cell line of this invention 
is pLTR.E4(-). This plasmid contains the endogenous 
constitutive retroviral MLV LTR and most of the Ad E4 
gene region except that the endogenous E4 promoter and a 
portion of E4 ORF1 are missing. The other plasmid 
15 sequences remain the same as described above. 

Still another plasmid useful for the study of ^ 
the methods of this invention is pLTR.E4, which contains ^ 
the constitutive MLV LTR and endogenous E4 promoter and ; 
an intact E4 gene. The other plasmid sequences remain 
20 the same as described above. ?.y 

To determine whether ORF6 expression was 
sufficient to enhance rAAV transduction, the inducible 
metallothionein (MT)-ORF6 minigene was stably transfected 
into HeLa cells. This new cell line, HeLa (MT-ORF6) was 
25 evaluated for LacZ rAAV transduction in response to ORF6 
induction as described below. The cell line 293 
(MT-ORF6) expresses ORF-6 of the E4 gene of Ad5 from the 
metallothionine promoter which is relatively inactive at 
baseline but can be induced with divalent cations. These 
30 293 cells were included to establish the baseline 
transduction efficiency. 

B. Transactions and Selection of Clones 

Each of the above-described plasmids was 
transfected by the calcium phosphate precipitati n 
35 technique into the human embryonic kidney cell line 293 
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[ATCC CRL1573] which expresses the product of the 
adenovirus El genes, or into HeLa cells, seeded on 100 mm 
plates (10 fig plasmid/plate) . Twenty f our hours post- 
transf ection, cells were harvested and seeded at varying 
5 dilutions (1:10 - 1:100) in 100 mm plates for about 10 
days. Seeding media contain G418 (Geneticin, BRL) at 1 
mg/ml. Resistant colonies that developed were selected 
using the following assays and expanded. Preliminary 
analysis of clones was based on enhanced transduction 

10 efficiency of a recombinant adeno-associated virus, 

AV.CMVLacZ, and immunofluorescence localization of Ad E4 
protein as described in the following examples. 
Example 2 - Recnmhinan t AAV and XV.CMBLacZ Transduction 
Enhancement Assay 

15 El and E4 Ad gene products are needed for 

recombinant adeno -associated virus (AAV) function. This 
primary assay involves seeding the packaging cell lines 
of Example 1 in 96 well 35 mm culture plates (2xl0 6 
cells/well) and infecting the cells with purified, heat- 

20 treated AV.CMVLacS at an MOI of 1000 virus 
particles/cell. 

A. Preparation of Recombinant AV.CMVLacZ 

A recombinant AAV virus was prepared by 
conventional genetic engineering techniques for the 

25 purposes of this experiment. Recombinant AAV was 

generated by plasmid trans feet ions in the presence of 
helper adenovirus [Samulski et al, J. Virol. . 63:3822- 
3828 (1989)]. A cis-acting plasmid pAV.CMVLacZ [SEQ ID 
NO: 4] (see Fig. 10) was derived from psub201 [Samulski 

30 et al, J. Virol, - 61:3096-3101 (1987)] and contains an E. 
coll 6-galactosidase minigene in place of AAV jRep and Cap 
genes. The 5 • to 3 1 organization of the recombinant 
AV.CMVI/acZ genome (4.9 kb) [SEQ ID NO: 4] includes 

(a) the 5* AAV ITR (bp 1-173) was 

35 obtained by PCR using pAV2 [C A. Laughlin et al, Gene . 



WO 96/39530 



PCT/US96/10245 



39 

23 ; 65-73 (1983)] as template [nuc. 53-219]; 

(b) a CMV immediate early 
enhancer /promoter [Boshart et al. Cell . 41x521-530 
(1985)] (nuc. 246-839); 
5 (c) an SV40 intron (nuc. 856-987) ; 

(d) E. coli fi-galactosidase cDNA (nuc. 

1039-4512); 

(e) an SV40 polyadenylation signal (a 237 
Bam HI-BclI restriction fragment containing the 

10 cleavage/poly-A signals from both the early and late 
transcription units (nuc. 4522-4719) and 

(f) 3 'AAV ITR, obtained from pAV2 as a 
SnaBI-Bglll fragment (nuc. 4759-4925) . All other 
nucleotides are plasmid derived. 

15 Rep and Cap genes were provided by a trans- 

acting plasmid pAAV/Ad [Samulski et al, cited above]. 

Monolayers of 293 cells grown to 90% confluency 
in 150 mm culture dishes (5xl0 7 cells/plate) were 
infected with H5.CBALP at an MOI of 10. H5 • CBALP (also 

20 called H5.010ALP) is a rAd that contains an alkaline 

phosphatase minigene in place of adenovirus Ela and Elb 
gene sequences (map units 1-9.2 of the Ad5 sequence of 
GenBank [Accession No. M73260]). The alkaline 
phosphatase cDNA is under the transcriptional control of 

25 a CMV-enhanced B-actin promoter in this virus. This 
helper virus is described in Goldman et al. Hum . Gene 
Ther. * 6:839-851 (July, 1995); Engelhardt et al, Hum. 
Gene Ther . . 5:1217-1229 (October, 1994); and references 
cited therein. 

3 0 Infections were done in Dulbecco's Modified 

Eagles Media (DMEM) supplemented with 2% fetal bovine 
serum (FBS) at 20 ml media/ 150 mm plate. Two hours post- 
infection, 50 pq plasmid DNA (37.5 nq trans-acting and 
12.5 cis-acting) in 2.5 ml of transfection cocktail 

35 was added to each plate and evenly distributed. 
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Transf ections were calcium phosphate based as described 
[B. Cullen, Meth. Enzvaoh . 152 :684-704 (1987)]. Cells 
were left in this condition for 10-14 hours after which 
the infection/ transf ect ion media was replaced with 20 ml 
5 fresh DMEM/2% FBS . Forty to fifty hours post- 

transf ect ion, cells were harvested, suspended in 10 mM 
Tris-Cl (pH 8.0) buffer (0.5 ml/150 mm plate) and a 
lysate prepared by sonication. The lysate was brought to 
10 mM manganese chloride, after which bovine pancreatic 

10 DNase I (20,000 units) and RNase (0.2 mg/ml final 

concentration) were added, and the reaction incubated at 
37 °C for 30 minutes. Sodium deoxycholate was added to a 
final concentration of 1% and incubated at 37 °C for an 
additional 10 minutes. 

15 The treated lysate was chilled on ice for 10 

minutes and solid CsCl added to a final density of 1.3 
g/ml. The lysate was brought to a final volume of 60 ml 
with 1.3 g/ml CsCl solution in 10 % mM Tris-Cl (pH 8.0) and 
divided into three equal aliquots. Each 20 ml sample was 

20 layered onto a CsCl step gradient composed of two 9.0 ml 
tiers with densities 1.45 g/ml and 1.60 g/ml. 

Centrifugation was performed at 25,000 rpm in a 
Beckman SW-28 rotor for 24 hours at 4°C. One ml 
fractions were collected from the bottom of the tube and 

25 analyzed on 293 or 293 (E4) cells for La.cZ transduction. 

Fractions containing peak titers of functional AV.CMViac^ 
virus were combined and subjected to three sequential 
rounds of equilibrium sedimentation in CsCl. Rotor 
selection included a Beckman NVT-90 (80,000 rpm for 4 

30 hours) and SW-41 (35,000 rpm for 20 hours). At 

equilibrium, AV.CMVLacZ appeared as an opalescent band at 
1.40-1.41 g/ml CsCl. Densities were calculated from 
refractive index measurements. Purified vector was 
exchanged to 20 mM HEPES buffer (pH7.8) containing 150 mM 

35 NaCl (HBS) by dialysis and stored frozen at -80°C in the 



WO 96/39530 



PCT/US96/10245 



41 

presence of 10% glycerol or as a liquid stock at -20°C in 
HBS/40% glycerol. 

Purified virus was tested for contaminating 
H5.CBALP helper virus and AV.CMVLacZ titers. Helper 
5 virus was monitored by histochemical staining for 

reporter alkaline phosphatase activity. A sample of 
purified virus representing 1.0% of the final product was 
added to a growing monolayer of 293 cells seeded in a 60 
mm plate. Forty-eight hours later, cells were fixed in 

10 0.5% glutaraldehyde/phosphate buffered saline (PBS) for 
10 minutes at room temperature, washed in PBS (3x10 
minutes) and incubated at 65 °C for 40 minutes to 
inactivate endogenous alkaline phosphatase activity. The 
monolayer was allowed to cool to room temperature, rinsed 

15 once briefly in 100 mM Tris-Cl (pH9.5)/100 mM NaCl/5mM 
MgCl, and incubated at 37 °c for 30 minutes in the same 
buffer containing 0.33 mg/ml nitroblue tetrazolium 
chloride (NBT) and 0.165 mg/ml 5-bromo-4-chloro-3- 
indolylphosphate p-toluidine salt (BCIP) . Color 

20 development was stopped by washing the monolayer in 10 mM 
Tris-Cl (pH 8.0)/5 mM EDTA. Routinely the purification 
scheme described above removed all detectable H5.CBALP 
helper virus by the third round of buoyant density 
ultracentr if ugation . 

25 AV.CMViacZ titers were measured according to 

genome copy number (virus particles /ml) , absorbance at 
260 nm (A^ particles/ml) and LacZ Forming Units 
(LFU/ml) . Virus particle concentrations were based on 
Southern blotting. Briefly, a sample of purified 

3 0 AV.CMVLacZ was treated with capsid digestion buffer (50 
mM Tris-Cl, pH 8.0/1.0 mM EDTA, pH 8.0/0.5% 
SDS/ Proteinase K 1.0 mg/ml) at 50°C for one hour to 
release virus DNA. The reactions were allowed to cool to 
room temperature, loading dye was added and 

35 electrophoresed through a 1.2% agarose gel. Standard 
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quantities of ds AV.CtNLacZ genome were also resolved on 
the gel. 

DNAs were electroblotted onto a nylon membrane, 
hybridized with a 32 P random primer labeled restriction 
5 fragment, and the resulting blot scanned on a 

Phosphor Imager 445 SI (Molecular Dynamics) . A standard 
curve was generated from the duplex forms and used to 
extrapolate the number of virus genomes in the sample. 
LFU titers were generated by infecting indicator cells 

10 with limiting dilutions of virus sample. Indicator cells 
included HeLa and 293 and 293 (£4) lines (described in 
Example 10 below) . Twenty-four hours later, cells were 
fixed in glutar aldehyde and cells were histochemically 
stained for E • coll £-galactosidase (LacZ) activity as 

15 described in J. M. Wilson et al, Proc. Natl. Acad. Sci. 
USA . £5:3014-3018 (1988). One LFU is described as the 
quantity of virus that is sufficient to cause visually 
detectable B-galactosidase expression in one cell 24 
hours post- infect ion. 

20 B. Induction of ORF6 Expression 

Induction of ORF6 expression with 10 MM 
dexamethasone or 150/xM zinc sulfate (for negative 
control, no inducer used) was initiated 2 hours before 
the addition of virus and continued throughout the 

25 duration of the experiment. Twenty-four hours after the 
addition of virus, cells were harvested, lysates were 
generated by sonication and analyzed for the 6- 
galactosidase expression (i.e., B-galactosidase activity) 
and virus DNA as described above. Hirt extracts were 

30 prepared from low molecular weight DNA from cell 

extracts. The preparation of the Hirt extracts and 
subsequent analysis by Southern hybridization were 
performed by resort to conventional procedures known to 
one of skill in the art. 

35 In the absence of the inducers, th packaging 
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cell lines generate lower levels of B-galactosidase in 
rAAV infected cells. Induction of ORF6 expression with 
the inducer dexamethasone results in a concomitant rise 
in AV.CMVXacZ cell transduction to a level that was much 
5 greater than the parent 293 line. Expression of El alone 
was insufficient to have an effect in the adenovirus 
mediated augmentation of rAAV transduction. 

Results are demonstrated for certain positive 
clones in the Table I below (see Example 4) . However, 

10 for 30 cell lines having an MMTV promoter and ORF6 

sequence, 4 demonstrated over 90% blue cells illustrative 
of LacZ production in the presence of dexamethasone, 
i.e., 293-27-6, 293-27-17, 293-27-18 and 293-27-28. 
Example - Timnunof luorescence Localization of A d5 Late 

15 Protein. 

Positive clones from the assay of Example 2 were 
infected with the recombinant E4 deleted adenovirus 
H5dll004 and screened for E4 complementation using an 
immunofluorescence assay for late gene expression. The 

20 H5dll004 virus was obtained from Dr. Ketner of Johns 

Hopkins University and is described in Bridge and Ketner, 
J. Virol. . 632 (2) :631-638 (Feb. 1989), incorporated by 
reference herein. Because ORF6 of E4 complements late Ad 
gene expression, specifically in the formation of the 

25 hexon and penton fibers of the adenovirus, cell lines 
containing ORF6 are able to bind with antibody against 
these proteins. 

Each cell line of Example 1 is infected with E4 
deleted virus H5dll004 virus at an MOI of 0.1. The cells 

30 were treated with mouse anti-adenovirus FITC-labeled 

monoclonal antibody to either the hexon or penton fibers 
in a 1:10 dilution (Chemicon International Inc., 
Temecula, CA) . Positive clones were identified by 
reaction with the antibody. 
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Example 4 - Relative Plaguing Efficiency 

The cell lines of Example 1 demonstrating with 
strong complementation ability in Example 3 were screened 
for relative plaquing efficiency of H5dll004 as compared 
5 to W162 cells (an E4 -complementing Vero cell line which 
does not express El) [Weinberg and Ketner, Proc. Natl, 
Acad, Sci. USA . 80 (17) : 5383-5386 (1983)]- In Table II 
below, RPE%, i.e., relative plaquing efficiency, 
represents the titer of H5dll004 on tested cell 

10 lines/titer of H5dll004 on W162 cells. For example, the 
RPE of 293 cells is 0. 

The positive cell lines selected by all criteria are 
identified in Table I below, with the results of the 
assays of Examples 2 , 3 and 4 . 

15 TABLE I 



E1/E4 Double Complementing Cell Lines 



Cell 


Trans- 


Pro- 




AV.CMV 




Line 


Gene 


moter 


IF/LP 


LacZ 


RPE% 


293-10-3 


ORF6 


MT 


++++ 


++++ 


246 


293-39-11 


ORF6 


LTR 


++++ 


+++ 


52 


293-84-31 


E4- 


LTR 


++++ 


++++ 


179 


293-12-31 


whole 


LTR 


++++ 


++++ 


174 




E4 


+E4 








293-27-6 


ORF6 


MMTV 




+++++ 


327 


293-27-17 


ORF6 


MMTV 




++++ 


313 


293-27-18 


ORF6 


MMTV 




+++++ 


339 


293-27-28 


0RF6 


MMTV 




++++ 


261 



Example 5 - Construction and Purification of H5 . OOlCBLacZ 
30 The plasmid pAd.CBLacZ was constructed as described 

in detail in K. Kozarsky et al, Som. Cell Mol. Genet. . 
19(5): 449-458 (1993), incorporated by reference herein. 
This plasmid contained a minigene comprising a 5 1 
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flanking Nhel restriction site, followed by Ad 5 sequence 
m.u. 0-1, followed by an El deletion into which is 
inserted a CMV enhancer /chicken 6-actin promoter sequence 
[T. A. Kost et al, Nucl. Acids Res. . 11(23) :8287 (1983)], 
5 which controls the transcription of the following 

bacterial fi-galactosidase, followed by a poly A sequence 
and flanked 3 1 by Ad m.u. 9-16, and another Nhel site. 
In the plasmid, the minigene was flanked on both sides by 
plasmid sequence containing drug resistance markers. 

10 The plasmid pAd.CBLacZ was linearized with Nhel and 

co-transf ected by the calcium phosphate co-transf ection 
method into the novel packaging cell line of Example 1 
with Clal digested H5dll004 (an Ad 5 sequence deleted of 
from about map unit 92.1 through map unit 98, 

15 corresponding to substantially the entire E4 gene) • 

Homologous recombination occurs in the cell line 
between these two viral constructs between Ad map units 
9-16, resulting in rAd, designated H5. OOlCBLacZ [SEQ ID 
NO: 3] (Fig. 2). This rAd contains the sequence from 

20 pAd.CB.LacZ (including Ad map units 0-1 (nuc. 1-330); CMV 
enhancer/ chicken 6-actin promoter (CB) (nucs. 370-928) ; 
E * coll B-galactosidase (nucs. 945-4429) ; the polyA (nuc. 
4429-4628); and Ad5 map units 9-92.1 and 97.3 to 100 from 
H5dll004 (nucs. 4671-35408)). This rAd is thereby 

25 functionally deleted, and substantially structurally 
deleted, of the Ad El and E4 genes. 

Viral plaques were selected and screened by the fi- 
galactosidase assay [Wilson (1988), cited above] and 
H5. OOlCBLacZ was isolated following three rounds of 

30 plaque purification. The purified virus was also 

subjected to cesium chloride density centrif ugation and 
large scale production. 

For the following mouse experiments, virus was used 
after column purification and glycerol was added to a 
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final concentration of 10% (v/v) . Virus was stored at - 
70 °C until use. 

Example 6 - Growth Kinetics of H5 . OOlCBLacg in Packaging 
Cell Lines 

5 The cell lines identified in Table I were infected 

with recombinant H5 . OOlCBJDacZ at an MOI of .5. The 
growth kinetics of this virus in the E4 complementing 
cell lines are shown in Fig. 3. Maximum viral yield is 
reported as LFU/ml in Table II below. 
10 TABLE II 



Cell Line 


Maximum Viral 


293-10-3 


2.8X10 10 


293-39-11 


9.5X10 8 


293-84-31 


1.1X10 9 


293-12-31 


4.5x10 s 


293-27-6 


2.8X10 10 


293-27-17 


2 . 5X10 10 


293-27-18 


2.9X10 10 


293-27-28 


1. 2X10 10 



20 When grown in 293-27-18 cells (the E4 ORF6 cell line 

with MMTV promoter inducible by dexamethasone) the 
maximum yield of this virus is 2.9xl0 10 LFU/ml. Several 
of the cell lines were passaged between 5 and 20 times 
and the viral production of the passages remained stable. 

25 However, RPE did fall following repeated passages of 
cells. 

Example 7 - Other Recombinant Adenoviruses 

Other related rAds were prepared similarly to 
H5 . OOlCBLacZ by homologous recombination between 
3 0 pAdCBLa c£T and other helper viruses. 

As one example, H5. QOOCBLacZ is a recombinant El 
deleted Ad5 which contains the same minigene as 
H5 . OOlCBLacZ, but has an intact E4 gene. This rAd was 
prepared as described by homologous recombination between 
3 5 pAdCBLacZ and a wild-type Ad5. 
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As another example, H5 . OXOC&LacZ contains the 
adenovirus map units 0-1, followed by a CMV enhanced, 
chicken cytoplasmic /?-actin promoter, the E. coll B- 
galactosidase gene (lacZ) , a polyadenylation signal (pA) , 
5 and adenovirus type 5 map units 9-100, with a small 

deletion in the E3 gene (the Ad 5 sub3 60 backbone) . This 
rAd may be prepared by homologous recombination between 
the pAdCBiac^ vector and Ad5 virus sub360, which contains 
a 150 bp deletion within the 14.6 kD protein of the E3 
10 gene. See, e.g. , J. F. Enaelhardt et al . Proc . Natl . 

Acad. Sci. . USA . 91:6196-6200 (June 1994); and Engelhardt 
et al, Hum. Gene Ther. , 5:1217-1229 (Oct. 1994), both 
incorporated by reference herein. 

These rAds were isolated following transfection 
15 [Graham, Virol. . 152:456-467 (1974)], and were subjected 
to two rounds of plaque purification. Lysates were 
purified by cesium chloride density centrif ugation as 
previously described [Englehardt et al, Proc . Nat 1 . Acad . 
Sci. USA . 88:11192-11196 (1991)]. Cesium chloride was 
20 removed by passing the virus over a BioRad DG10 column 
using phosphate-buffered saline. 
Example 8 - LacZ Gene Transfer into Mouse 
A. Transfer Into Mouse Muscle 

Five to six-week old male C57B/6 mice were 

2 5 anesthetized. Anterior tibialis muscles were exposed and 

directly injected with either rAd H5 . 000CBI*ac£, 
H5. QlOCBLacZ or H5 . OOlCBiac^ as follows: 25 /iL of 
purified viral suspension at a stock concentration of 5 x 
10 11 virus particles /mL was injected by inserting the tip 

3 0 of the 3 3 gauge needle of a 100 filj Hamilton syringe into 

the belly of the muscle. 

Animals were sacrificed on day 4, 14, 28 and 60 
post injection. The muscles were dissected and frozen in 
liguid nitrogen cooled isopentane. Six mM sections were 
3 5 cut in a cryostat, fixed and stained for B-galactosidase 
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activity for 6 hours at 37°C. 

While the blue stained rAd was found for each 
virus in the day 4 and day 14 (most abundant) stains, by 
day 28, the HS.OOlCBLacZ clearly demonstrated more virus 
5 on day 28. By day 60 , the only virus which stained 
positive was the H5 . 001CB£ac£. 

B. Transfer into Mouse Lung and Circulation 
RAd HS.OOOCBLacZ (control), and HS.OOlCBLacZ 

(IxlO 11 viral particles) were administered to six week old 
10 C57BL/6 female mice by tail vein injection and trachea 

installation. The animals were sacrificed and their 

liver and lung tissues were harvested at days 4, 9, 21, 

28 and 35 post-administration. The transgene and viral 

late gene expression were compared. 
15 At therapeutic doses of virus, there was 

diminished expression of late viral proteins at all time 

points in comparison with transgene. 

C. Dose Responses in Liver 

Dose responses of E4 -deleted and E4 intact rAds 

2 0 in the liver of C57BL/6 mice were studied by tail vein 

administration of 1.5X10 11 , 5xl0 10 , 1.7xl0 10 , 5.6xl0 9 , and 
1.9X10 9 viral particles and comparing the transgene and 
viral late gene expression at day 4, 21, 28, 35, and 42 
post administration. 
25 At therapeutic doses of virus, there was 

diminished expression of late viral proteins at all time 
points in comparison with transgene. 
Example 9 - Other Gene Transfers 
A. Human OTC Gene Transfer 

3 0 The human OTC gene [A. L. Horwich et al, 

Science. 224:1068-174 (1984)] or the human CFTR gene 
[Riordan et al, Science . 245:1066-1073 (1989)] was used 
to replace the LacZ as the transgene in the recombinant 
E1/E4 deleted adenoviruses described above, using the 
35 techniques analogous for the construction of the above- 
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described LacZ vectors. 

The resulting human OTC-containing rAd were 
administered at an MOI of 10 to 30 to human hepatocytes. 
The E1/E4 deleted rAd demonstrated less replication and 
5 less late gene expression than when the E1/E4 deleted 
rAds are administered to muscle/ as described in the 
example above. However, the results of this gene 
transfer are better than comparable transfers with rAds 
containing only a deletion in the El gene or a deletion 
10 in the El gene and a point mutation in the E2a gene._ 

Similar results are demonstrated when the 
transgene is CFTR and the method of administration is 
intratracheal into lungs. 

Example 10 - Transduction Efficiency of rAAV LacZ 
15 AV.CMV.Lac^ in HeLa Cells Infected with Ad Mutants 
A. Viruses 

The following viruses were employed in this 
experiment: 

(1) Wild-type Ad 5, propagated in 293 cells; 
20 (2) Ad dlllO (an Ad which is deleted of the 55 

kb E1B gene) [Babiss et al, J. Virol. . 52 (2) : 389-395 
(1984) and Babiss and Ginsberg, J. Virol. , 50 (1) : 202-212 
(1984)], propagated in 293 cells, 

(3) H5.CBALP (an Ad deleted of its E1A and E1B 
25 genes and containing a minigene that expresses alkaline 

phosphatase from a CMV enhanced /?-actin promoter, as 
described above) , propagated in 293 cells, 

(4) Ad tsl2 5 (an Ad with a temperature 
sensitive mutation in the E2A gene which encodes the DNA 

30 binding protein) [Ensinger and Ginsberg, J. Virol. . 
10 (3) :328-339 (1972)], propagated in 293 cells, 

(5) Ad dl802 (an Ad deleted of its E2a gene) , 
grown in E2A-complementing gmDBP cells as described in 
Rice and Klessig, J. Virol. , 56 (3 ): 767-778 (1985); 
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(6) Ad dll004 (an Ad deleted of the E4 gene) , 
grown in E4 -complementing Vero W162 cells [Weinberg and 
Ketner, Proc. Natl. Acad. Sci. USA , 80(17) :5383-5386 
(1983) ] and 

5 (7) Ad dllOlO (an Ad deleted of ORF6 of its E4 

gene) , grown in E4 -complementing Vero W162 cells 
[Weinberg and Ketner, cited above] . 

All viruses were purified by two sequential 
rounds of buoyant density ultracentrif ugation in CsCl. 

10 B. Experimental Procedures 

HeLa cells seeded in 6 well, 36mm culture 
plates (2xl0 6 cells/well) were infected with wild-type 
Ad 5 or an adenovirus early gene mutant as described in 
Part A at an MOI of lOpfu/well. Infections were done in 

15 1.0 ml DHEM/ 2 % FBS • Six hours post-infection, monolayers 
were washed and 1.0 ml fresh DMEM/2% FBS media containing 
AV.CMVLacZ at 4xl0 9 virus particles/ml were added. 
Although the AV.CMV.LacZ virus lot used in these 
experiments was shown to be free of H5.CLALP helper virus 

20 by histochemical staining, the virus sample was subjected 
to heat treatment (60°C for 20 minutes) prior to use to 
ensure the absence of contaminating adenovirus. Two 
hours later, 1.0 ml of DMEM/115% FBS was added to each 
well. 

25 Twenty-four hours after the addition of 

AV.GMVLacZ, cells were harvested. Each test condition 
was done in triplicate to enable virus transduction to be 
evaluated in terms of three outputs: histochemical 
staining for fi-galactosidase activity (below) , 

30 intracellular B-galactosidase specific activity (Example 

11) , and the molecular form of the virus DNA (Example 

12) . 

HeLa cells were histochemically stained for E. 
coli £-galactosidase (LacZ) activity as described in J. 
35 M. Wilson et al, Proc. Natl. Acad. Sci. USA , 85:3014-3018 
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(1988) . The different combinations that w re tested 
included cells transfected with AAV vector alone 
(AV.CMVI^cZ) , vector plus wild-type Ad5 (+Ad5) , vector 
plus dlllO (+dlllO) , vector plus Ad mutant H5.CBALP 
5 (+H5.CBAI*P) , vector plus Ad mutant tsl25 (+tsl25) , vector 
plus Ad mutant dl802 (+dl802) , vector plus Ad mutant 
dll004 (+dll004) , and vector plus Ad mutant dllOlO 
(+dll010) • 

The results were observed in photomicrographs 

10 at magnification 10X (not pictured) of histochemical 
stains for recombinant B-galactosidase activity. The 
results indicated that wild-type Ad5 and the E2a mutants 
tsl25 and dl802 caused a significant increase in LacZ 
rAAV transduction as measured by the number of positive 

15 blue cells and the degree of stain intensity. Both dlllO 
(ElB-55kDa) and dllOlO (ORF6) enhanced transduction to 
levels that approached those of Ad5, ts!25, and dl802 in 
terms of the number of positive blue cells. 

The El deletion recombinant H5.CBALP provided 

20 no significant increase in AV.CMVLac£ transduction. 

Expression of El alone was insufficient to have an effect 
in the adenovirus mediated augmentation of rAAV 
transduction as evidenced by lack of significant increase 
in transduction obtained with HeLa cells infected with 

25 the E4 deletion mutant dll004. A significant drop in 

transduction occurred following removal of ORF6 from the 
E4 region from the coinfecting adenovirus (Fig. 5A) . 

It is believed that these results demonstrate 
that the adenoviral gene products, E4 and El indirectly 

30 promote the formation of ds DNA intermediates that are 
transcriptionally active. 

Example 11 - Quantitation of Enhanced Vect or Transduction 

(A) A duplicate set of HeLa cells as described in 
Example 10B were used in this xperiment. Twenty-f ur 
35 hours after the addition of AV.CMVLacZ recombinant, for 
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intracellular B-galactosidase assays, cell pellets were 
suspended in 0.5 ml PBS and sonicated. Cell debris was 
removed by centrifugation (15,000Xg for 10 minutes) and 
the clarified extract assayed for total protein [M. 
5 Bradford, Anal. Biochem. , 72 (1-2) :248-254 (1976) and M. 
Bradford et al, Fed. Proc. . 35(3) :274 (1976)] and B- 
galactosidase activity [Sambrook et al, cited above] 
using o-nitrophenyl B-D-galactopyranoside (ONPG) as 
substrate . 

10 Fig. 5A demonstrates the transduction 

efficiency quant it ated by measuring B-galactosidase 
enzyme activity in the lysates from infected Hela cells 
and also assayed for total protein. In Fig. 5A, the test 
condition is shown along the horizontal axis, and 

15 intracellular B-galactosidase specific activity 

(milliunits/mg protein) using ONPG as substrate is 
plotted on the vertical axis. Below each bar, the 
fold- induction in specific activity relative to cells 
that received the AV.CMV-Lac^ vector alone is given. 

20 The results of Fig. 5A demonstrate that the E2a 

mutants tsl25 and dl802 produced 134-fold and 225-fold 
increases in B-galactosidase activity, respectively, as 
compared to that achieved with purified rAAV alone. In 
comparison, cells infected with wt AdS generated 107-fold 

25 increase in B-galactosidase activity. 

(B) In another experiment, HeLa cells (2xl0 6 ) were 
infected with increasing multiplicities of wild-type AdS 
or the E2 mutant dl802. Six hours post-infection, 
monolayers were washed and infected with AV.CMVIracZ at 

30 1000 virus particles/ cell. Twenty-four hours after the 
addition of AV.CMVLacZ , cells were harvested and assayed 
for total protein and B-galactosidase activity. 

The results are illustrated in the bar graph of 
Fig. 5B, in which adenovirus MOI's are given along the 

35 horizontal axis, and intrac llular B-galactosidase 
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specific activity along the vertical axis. Enhancement 
of rAAV transduction was proportional to input helper 
adenovirus from MO Is of 1 to 50 for both wild type Ad5 
and dl802. Higher doses of virus were cytopathic, 
5 leading to a fall in B-galactosidase expression. 

Enhanced transduction was achieved when the cells were 
infected prior to, or at the time of , rAAV infection. 
The El deletion recombinant H5.CBAU> and the E4 deletion 
mutant dll004 provided no significant increase in 

10 KV.CXWLacZ transduction. Both cells infected with dlllO 
(ElB~55kDa) and with dllOlO (ORF6) demonstrated 
substantially lower total B-galactosidase activity than 
those infected with Ad5, tsl25, or dl802. 
Example 12 - Analysis of Low Molecular Weight DNAs in 

15 AV.CMVIiacg Transduced Cells 

Studies with these early gene mutants of adenovirus 
suggested that expression of adenoviral genes rather than 
the virion itself was responsible for enhancement of rAAV 
transduction. To further investigate these mechanisms 

20 and to determine if conversion of ss to ds genome limits , 
the transduction efficiency of rAAV, the molecular state : 
of the rAAV genome was characterized in the infected ^ 
cells. The relationship between RFm formation and lacZ 
rAAV transduction was explored in experiments where the 

25 dose of coinfecting virus was varied (MOI=l, 5, or 10). 
(A) A duplicate set of HeLa monolayers as 
described in Example 10 were harvested 24 hours after 
they were transduced with the recombinant AV.CMVXacZ and 
cultured with or without helper adenovirus. 

30 Episomal DNA was extracted from cell pellets 

using a modification of the procedure originally 
described by B. Hirt, J. Mol. Biol. , £6:365-369 (1967). 
Briefly, cells were suspended in 320 ml Tris-Cl 
(pH8.0)/10 mM EDTA and SDS added to a final concentration 

35 of 1%. The mixture was incubated at 37 °C for 30 minutes. 
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Pronase and proteinas K ver added to final 
concentrations of 500 /*g/ml and 20 pg/ml, respectively, 
and the reaction incubated at 37 °C for 2 hours. Sodium 
chloride was added to a final concentration of 1.1 M and 
5 incubated at 4°C overnight. The precipitate that 
developed during the 4°C incubation was pelleted at 
20,000 xg for 30 minutes and the clear supernatant 
carefully removed. The supernatant was extracted once 
with phenol : chloroform : isoamyl alcohol (25:24:1) followed 

10 by chloroform: isoamyl alcohol (24:1). Nucleic acids were 
precipitated with ethanol. The final pellet was 
suspended in 50 /jtl Tris-Cl (pH 8.0) /1 .0 mM EDTA. 

These Hirt extracts were analyzed by Southern 
blot hybridization. Samples (5 Ml) of each Hirt extract 

15 were resolved through a 1.2% agarose gel, electroblotted 
onto a nylon membrane and hybridized with a 32 P random 
primer labeled cDNA of the SV40 polyA signal used in 
AV.CMVLacZ. 

An autoradiogram of the experiment of Example 

20 12 (not pictured) , identifies and labels bands 

corresponding to the ss AV.CMViacZ genome (SS) , a monomer 
replicative form (RFm) , and concatomer replicative forms 
(RFd) • Bands corresponding to the ss AV .CMVLacZ genome 
(SS) , a monomer replicative form (RFm) , and concatomer 

25 replicative forms (RFd) were identified and labeled. To 
reference the RFm band, a plasmid carrying AV.CMVLacZ was 
digested to release the entire genome. Autoradiogram 
exposure times were 14 hours and 69 hours. 

In this autoradiogram, the full spectrum of 

30 molecular species present during a lytic infection was 
demonstrated in cells infected with both LacZ rAAV and 
wild type adenovirus. Both the input ss genome (SS) and 
monomeric and dimeric forms of ds replicative 
intermediates (RFm and RFd) are present. This contrasts 

35 with cells infected with purified rAAV alone, where ss 
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genome is the sole molecular form detected. Analysis of 
cells coinfected with the adenovirus early gene mutants 
revealed a direct correlation between formation of ds 
forms of the rAAV genome and the enhancement of LacZ 
5 transduction. Mutant adenoviruses that were ineffective 
in enhancing rAAV transduction (i.e., the El deleted 
mutant H5.CBALP and the E4 deleted mutant dll004) failed 
to promote the formation of ds forms of AAV* 

Cells infected with adenovirus deleted of E2a 

10 (dl802) or partially deleted of El (dlllO) or E4 (dllOlO) 
additionally demonstrated a band whose size was identical 
to the ds replicative monomer (RFm) of the lacZ rAAV 
genome and whose abundance correlated directly with the 
expression of B-galactosidase activity (compare results 

15 of Example 14 to these described results) . Slower 

migrating concatomers, likely dimers, of duplex rAAV were, 
also detected in the autoradiogram described above. 

In the presence of El and E4 expression, rAd 
transduction was invariably accompanied by the appearance, 

20 of ds RF monomers and dimers. 

The high molecular weight band in sample lane 
+H5.CBALP is helper virus DNA. Helper virus DNA is 
recognized by the SV40 probe because the CBALP minigene 
also utilizes the SV40 polyA signal. 

25 (B) In another experiment, HeLa cells were infected 

with wt Ad5 or the E2 deleted mutant dl802 as described 
in Example 10B. Monolayers were harvested 24 hours later 
and analyzed for B-galactosidase activity and RFm 
synthesis • Monomer bands similar to those shown in the 

30 autoradiogram described above were quantitated on a 
Phosphor Imager 44 5 SI and assigned values (CPM) . 

The results are illustrated in the graphs of 
Figs. 8A and 8B, in which B-galactosidase specific 
activity and CPM are plotted along the vertical axis of 

35 each figure. Adenovirus MOI's are given on the horizontal 
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axis of each figure. Data obtained from low MOI 
infections (1, 5, and 10) are shown. Importantly, the 
tight correlation between rAAV vector transduction and 
the accumulation of duplex forms could be achieved in 
5 cells infected with E1/E4 expressing adenovirus. The 
level of B-galactosidase and abundance of RFm increased 
in proportion to the amount of infecting wild type Ad 
(Fig. 6A) and dl802 (Fig 6B) . These data suggest that 
synthesis of an episomal duplex intermediate is an 

10 obligatory event in transduction. 
Example 13 - Duplex End-Analvsis 

The following is a description of a model for 
leading strand synthesis of a complementary AAV strand in 
the presence of Rep (+Rep) or absence of Rep (-Rep) . 

15 Refer to Figs. 7A-7F. Rep expresses a terminal 

resolution activity that can convert a duplex structure 
with closed-ends to an open-ended duplex. In the absence 
of Rep, terminal resolution is impaired leaving the 
covalently closed, hairpin structures intact. Under 

20 these conditions, hairpins are expected to be found 

leftward and rightward, since both strands of a rescued 
ds AAV genome are packaged into virions. Figs. 7B-7F are 
a flow chart demonstrating the strategy for identifying 
the terminal structure of duplex RFm that is synthesized 

25 from ss AV.CMVLacZ in response to adenoviral gene 
expression. 

Fig. 7C illustrates a closed end and an open end 
fragment of rAV.CMVLacZ. Figs. 7D, 7E and 7F indicate 
the mixture of open-ended and covalently closed duplex 

30 fragments generated by NotI digestion at position 4509 in 
the absence of terminal resolution. The NotI 4509 
digestion provides a convenient means of releasing a 361 
bp fragment that contains the right ITR in the context of 
a hybridization target (i.e. SV40 pA) . In the presence 

35 of terminal resolution, only the open-ended 361 bp 
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fragment would be expected to be generated (Fig. 7D) by 
such digestion. 

The resulting electrophoretic gel (not pictured) , 
revealed in lane (1) the results of digestion of a 
5 plasmid carrying an AV.CMViac^ cDNA to release the rAAV 
vector. and subsequent digestion with NotI to release the 
right terminal 361 bp fragment. In lane (2) a sample of 
NotI digested Hirt DNA extracted from HeLa cells infected 
with wild- type Ad5 and transduced with AV.CMV.LacZ 

10 resulted in the release of two fragments, labeled Form! 
and Formll. (See, also, Figs. 8A and 8B) . The migration 
of ss AV.CMVLacZ (SS) and RFm were also seen. 

The ds AV.CMVLacZ intermediates that accumulated in 
cells infected with adenovirus were likely the result of 

15 leading strand DNA synthesis, initiating from the duplex 
region of the vector ITR. In the absence of Rep, this 
conversion event was anticipated to generate molecules in 
which one end is open and the other is covalently closed, 
(Fig. 7A) . To further characterize the structure of this 

20 ds intermediate Hirt extracts from cells coinfected with- 
rAV.CMRLacZ and Ad5 were digested with NotI to release - 
the termini of the ds intermediate which, if left open, _ 
would be approximately 361 bp in length. The resulting 
filters were hybridized with a probe specific for the 

25 SV40 polyadenylation signal positioned immediately 

upstream of the rightward ITR. At least two forms were 
released from the right end of duplex genomes, one that 
migrated to a position in the gel that predicted an 
open-ended conformation (Form II) , and a second slower 

30 migrating species (Form I) . Although this result was 

consistent with the model (Figs. 7A-7F) , it was difficult 
to predict with certainty the structure of Form I. Its 
retarded mobility did, however, suggest a conformation 
that differed from the open-ended Form II. 
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Example 14 - Analysis of AV.CMVLacZ Transduction 
Efficiency in 293 Cells Stably Transfected with an 
Inducible E4 ORF6 cDNA 

Cell lines used in this assay were prepared as 
5 described in Example l. 293(MT-ORF6) cells and HeLa(MT- 
ORF6) cells were seeded in 6 well 35 mm culture plates 
(2xl0 6 cells/ well) and infected with purified, heat- 
treated AV.CMViacZ at an MOI of 1000 virus 
particles/cell. Induction of ORF6 expression with from 

10 none to increasing concentrations of zinc sulfate was 
initiated 2 hours before the addition of virus and 
continued throughout the duration of the experiment. 

Twenty-four hours after the addition of virus, cells 
were harvested, lysates were generated by sonication and 

15 analyzed for the 6-galactosidase expression (i.e., 

B-galactosidase activity) and virus DNA as described in 
the preceding examples. Hirt extracts were prepared from 
low molecular weight DNA from cell extracts. The 
preparation of the Hirt extracts and subsequent analysis 

20 by Southern hybridization were performed similarly to 
those described in the examples above. 

The results of this experiment were as follows: 
(1) Specific Activity 

The results are illustrated in the bar 

25 graph of Fig. 8A. Specific activity (milliunits 

fi-galactosidase/mg protein) is plotted along the vertical 
axis. Below each bar is given the concentration of zinc 
used for induction, the fold-induction relative to 293 
cells, and the fold-induction relative to 293(ORF6) cells 

30 maintained in the absence of zinc. As shown in Fig. 8A, 
in the absence of Zn+2 , the 293(MT-ORF6) cell line 
generated 3 9-fold higher levels of B-galactosidase in 
rAAV infected 293 cells. Induction of ORF6 expression 
with increasing amounts of Zn +2 resulted in a concomitant 

35 rise in XV .CMVLacZ cell transduction to a level that was 
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445-fold greater than th parent 293 line. Expression of 
El alone was insufficient to have an effect in the 
adenovirus mediated augmentation of rAAV transduction. 

The specific activity of B-galactosidase 
5 was 196.2 mUnits/mg in E1/E4 expressing 293 cells, 

compared to 1.0 mUnit/mg in 293 cells that only expressed 
El genes. These experiments support a mechanism for 
enhancing rAAV transduction that is dependent on the 
combined expression of both El and E4 adenoviral genes. 

10 (2) Molecular Analysis of the AV.CMVLacZ 

Genome 

The duplex monomer replicative form (RFm) 
was quant ita ted and the values (CPM) plotted along the 
vertical axis in the bar graph of Fig. 8B. The 

15 concentration of zinc used for induction and the 

fold- induction relative to 293(ORF6) cells maintained in 
0 mH zinc is given below each bar. 

An autoradiogram (not pictured) shows the 
agarose gel resolved Hirt extracts from the AV.CMVLacZ 

20 transduced cells described above. A plasmid carrying the 
AV.CMViacZ cDNA was digested to release the entire 
sequence and loaded in a lane of the autoradiogram. The 1 : 
band that appeared in this lane therefore reflected the 
migration of a monomer duplex replicative form (RFm) . 

25 The migration of the ss AV.CMVLacZ genome (SS) , RFm, and 
dimers of the duplex replicative form (RFd) were also 
shown. Lanes of the autoradiogram labeled (0) , (50) , 
(100), (150), (200), and (250) contained samples from 
293(MT-ORF6) cells that were induced with the indicated 

30 concentration of zinc. A Hirt extract from 293 cells 
(lane labeled 293) transduced with AV.CMVLacZ was also 
shown. 

Analysis of Hirt extracts revealed the 
presence of the RFm in the rAAV infected 293(MT-ORF6) 
35 cells that was not present in similarly infected 293 
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cells. When the induction profiles (Figs. 8A and 8B) 
that describe AV.CMViacZ transduction efficiency were 
compared, the results were plotted in Fig. 8C. Specific 
activity (milliunits 6-galactosidase/mg protein) data 
5 from Fig. 8A and counts -per -minute data (CPM) of 
AV.CUVLacZ RFm from Fig. 8B are plotted along the 
vertical axis, and concentration of zinc sulfate used 
during the experiment is shown along the horizontal axis. 

The two profiles are near mirror images. 

10 Importantly, the RFm increased in proportion to the 

increment in lacZ transducing activity that occurred as 
ORF-6 expression was induced with Zn +2 (Fig. 8C) . Similar 
results were obtained with a 293 derived cell line that 
expresses ORF6 from the glucocorticoid responsive MMTV 

15 promoter. 

Example 15 - Enhanced AV.CMVLacZ Transduction in HeLa 
Cells Carrying an Inducible ORF6 Miniaene 

HeLa (MT-0RF6) cells (2xl0 6 ) were transduced at an 
MOI of 1,000 AV.CMVLacZ recombinant particles/cell in 

20 absence of zinc sulfate inducer or in the presence of 50, 
100, 150, 200, or 250 fM zinc sulfate inducer in the 
media during transduction. Twenty-four hours later, 
cells were harvested, cell extracts were prepared by 
sonication, and analyzed for transgene expression (i.e., 

25 B-galactosidase activity) . Cell monolayers were 

histochemically stained for B-galactosidase activity. 

The resulting photomicrographs (not pictured) 
illustrated that histochemical staining revealed an 
increase in the number of cells scored la.cZ positive as 

30 the concentration of Zn +2 in the medium was raised from 0 
to 200 mM. Concentrations of 250 mM zinc were found to 
be toxic to the cells. 

Specific activity (milliunits fi-galactosidase/mg 
protein) is plotted in Fig. 9 along the vertical axis. 

35 Below each bar is given the concentration of zinc used 
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for induction, the fold- induction relativ to HeLa cells, 
and the fold-induction relative to HeLa (Mt-ORF6) cells 
maintained in the absence of zinc. Histochemical 
staining revealed an increase in the amount of 
5 S-galactosidase in lysates as the concentration of Zn +2 
in the medium was raised from 0 to 200 mM. 
Example 16 - Southern Blot Analysis of Low Molecular 
Weight DNAs from AV.CMVIracZ Transduced HeLa (MT— ORF6 ) 
Cells 

10 Fol loving Induction of E4QRF6 

Hirt extracts were prepared from HeLa (MT— 0RF6 ) cells 
transduced with KV.CMHLacZ as described in Example 15 in 
the presence of increasing concentrations of Zn +2 to 
determine whether synthesis of duplex intermediates 

15 contributed to the augmentation in AV.CMVLacZ 
transduction. 

Samples of HeLa (MT-0RF6 ) cells that were induced 
with a concentration of zinc sulfate (0, 50, 100, 150, 
200, and 250) were resolved on a 1.2% agarose Southern 

20 gel (not pictured) , transferred to a nylon membrane, and 
hybridized with a Lac2-specif ic probe. One lane 
contained a plasmid encoding AV.CMVLac^ that was digested 
to release the entire genome. Bands corresponding to the 
ss AV.CMVLac^ genome (SS) , duplex monomers (RFm) , and 

25 duplex dimers (RFd) were indicated on the gel. 

Southern analysis indicated that Hela and uninduced 
Hela (MT— 0RF6 ) cells demonstrated a single band on 
Southern blots which comigrated with the ss genome. 
Induction of ORF-6 resulted in the appearance of 

30 detectable levels of ds monomer but only at higher 

concentrations of Zn+2. A band comigrating with the RFd 
was present in all cell preparations, the relevance of 
which is unclear since the monomer is a likely precursor 
to the dimer. 
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Example 17 - Effect of Adenovirus Infection on In Vivo 
AV.CMVXacg Targeti ng Efficiency To Murine Liver 

The impact of adenoviral gene expression on rAAV 
transduction in murine liver was studied by sequentially 
5 infusing into the portal vein early gene mutants of 
adenovirus followed by rAAV. 

Balb/c mice, 4- to 6-weeks old [Jackson 
Laboratories, Bar Harbor, Maine] were anesthetized by an 
intraperitoneal injection of ketamine (70 mg/kg) and 

10 xylazine (10 mg/kg). For liver studies, a 1 cm left 
flank incision was made and the spleen exposed. 

Samples of purified, heat-treated AV.CMVLetcZ in 50 
Ml HBS (lxl0 u virus particles) were used alone or spiked 
with helper adenovirus containing 2xl0 10 A^o particles of 

15 purified dll004, H5.CBALP, or ts!25 in a final volume of 
50 /il- The dose of adenovirus was sufficient to 
transduce >25% of hepatocytes. The virus mixture was 
injected just beneath the splenic capsule and the abdomen 
was closed with 3-0 vicryl. 

20 Necropsies were performed 3 days post-infusion and 

tissue frozen in O.C.T. embedding compound. Frozen 
sections (6/xm) (LacZ+ALP) were prepared and 
histochemically stained for B-galactosidase enzyme and 
alkaline phosphatase activity. Sections were 

25 counter stained with neutral red and mounted. 

A B-galactosidase positive hepatocyte targeted with 
AV.CMVLac^ at magnification 20X was obtained. 
Histochemical analyses of liver tissue harvested 3 days 
after gene transfer demonstrated that administration of 

30 io u particles of purified rAV.CMVX,ac2T alone into the 
portal vein was not associated with appreciable gene 
transfer (<0.01% of cells), confirming the inherent 
inefficiency of the rAAV system. 

Preinfusion with E4 deleted virus had no impact on 

35 rAAV transduction in mouse liver, whereas El deleted 
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virus demonstrated a modest increment in lacZ positive 
hepatocytes to about 0.1%. The most significant increase 
in rAAV transduction occurred following infusion of the 
E2a adenovirus mutant tsl25 with la.cZ expression detected 
5 in 10-25% of hepatocytes. A direct relationship between 
adenovirus gene expression and rAAV transduction was 
demonstrated in animals infused with both lacZ rAAV and 
the ALP expressing El deleted virus. The dose of 
adenovirus was reduced 10-fold to minimize the 
10 coincidental occurrence of coinf ection. Histochemical 
studies demonstrated co-localization of ALP and 3- 
galactosidase in the majority of B-galactosidase 
expressing hepatocytes. 

Example 18 - Effect of Adenovirus Infection on In Vivo 

15 AV-CMVLacff Targeting Efficiency To Murine Luna 

Experiments described in Example 17 for mouse liver 
were adapted for the study of rAAV mediated gene transfer 
to mouse lung. For lung experiments, anesthetized Balb/C 
animals were intubated as described in DeMatteo et al, 

20 Transplantation (Baltimore^ . 59 (5) : 787-789 (1995). 

Briefly, a midline 2 cm skin incision was made in the 
neck to expose the trachea. A 2 inch 18 gauge 
angiocatheter was passed through the mouth, positioned in 
the midportion of the trachea, and connected to a rodent 

25 ventilator (#55-3438 Harvard). Polyethylene (PE#10, 

Intramedic) was fed through the catheter via a side port 
and advanced beyond the tracheal bifurcation. Using a 
Hamilton syringe, virus samples (30/ul) were slowly 
infused into the lung through the polyethylene tubing. 

30 Samples contained the same formulation of purified, heat- 
treated AV.CMVLacZ with or without helper adenovirus, as 
described for liver injections. 

Tissue was harvested 72 hours post-infusion. Frozen 
sections wer histochemically stained for B-galactosidase 

35 activity and counter stained with neutral red. 
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Frozen sections from lung (AV. CMHLacZ) shoved a B- 
galactosidase positive airway epithelial cell targeted 
with XV.CMVLacZ. similar studies were performed in the 
murine model of lung-directed gene transfer. 
5 Adenoviruses were instilled into the trachea prior to the 
instillation of rAAV. Analysis of lung tissue 3 days 
later revealed only a rare 6-galactosidase positive cell 
in animals instilled with rAAV alone. No detectable 
enhancement of rAAV transduction was noted in animals 

10 preinstilled with adenovirus deleted of either El or E4. 
Substantial enhancement of transduction was achieved in 
conducting airway and alveolar cells of animals 
administered the E2a mutant adenovirus. 

These experiments in murine models of gene therapy 

15 directed to liver and lung verified that the efficiency 

of rAAV transduction is low due limited conversion of the 
input ss genome to a transcriptionally active ds 
intermediate, and that this conversion is facilitated by 
expression of adenovirus El and E4 gene products. 

20 Example 19 - Second Generation rAAV with Regulated 
Miniaene Capable of Enhancing Transduction 

The experiments described in previous examples 
illustrated the following principles: 1) purified rAAV is 
a relatively inefficient gene transfer vehicle in vitro 

25 and in vivo and 2) the rate limiting step in transduction 
is not viral entry but rather conversion of the virion § s 
ss DNA genome to a transcriptionally active ds DNA 
genome. Adenovirus can substantially enhance 
transduction through expression of a subset of its genes. 

30 It does this by promoting conversion of the virion's 

genome to its ds form. One approach to accomplish this 
is to incorporate into the recombinant AAV genome a 
minigene that expresses the minimal adenoviral genes 
necessary to enhance transduction, i.e., the 0RF6 region 

35 of E4. 
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Two approaches have been considered in designing 
this modified rAAV. The first strategy is based on a 
rAAV genome that has two transcriptional units in series, 
one expressing the therapeutic gene and the other 
5 expressing its E4 0RF6 from a constitutive promoter. 
While this may, in fact, be useful in many situations, 
constitutive expression of ORF6 may be detrimental to the 
cell and potentially could elicit a destructive immune 
response* 

10 The second version of this rAAV includes the 

therapeutic minigene in addition to the ORF6 
transcriptional unit which, in this case, is expressed 
from an inducible promoter. When this second gene rAAV 
is administered to the cells (ex vivo strategies) or to 

15 the patient (in vivo strategies) , the inducing agents are 
administered at the time of gene transfer or soon 
thereafter. If the ds genomic form or its integrated 
derivative is stable, the induction of ORF6 will only be 
necessary at the time of gene transfer into the recipient 

20 cell. Following this, its inducing agent will be 
withdrawn and the ORF6 gene will be turned off. 

An rAAV that illustrates this concept of inducible 
ORF6 has been constructed and tested in vitro. A 
schematic of the vector pAV.CMVALP.GRE-ORF6, is shown in 

25 Fig. 11 and its sequence is illustrated in SEQ ID NO: 5. 
This second generation construct contains flanking 5 1 and 
3* AAV ITR sequences. The human placental alkaline 
phosphatase cDNA (ALP) is included in a minigene in which 
the promoter from the immediate early gene of 

30 cytomegalovirus drives the transcription. A second 

transcriptional unit is cloned between the ITRs in series 
and in direct orientation with the alkaline phosphatase 
minigene. The second transcriptional unit expresses the 
Ad5-E4-0RF6 from a glucocorticoid depend nt promoter 

35 (GRE) with an SV4 0 polyadenylation signal. This is 
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called a s cond generation rAAV construct. 

Specifically, pAV.CMVALP.GRE-ORF6 [SEQ ID NO: 5] 
generates a novel rAAV containing the LacZ transgene and 
the Ad E4 ORF 6 which facilitates ss to ds conversion of 
5 rAAV. The plasmid includes a flanking AAV 5 1 ITR 

sequence (nucs. 53-219) ; CMV enhancer /promoter (nucs. 
255-848); human placenta alkaline phosphatase cDNA (ALP) 
(nucs. 914-2892); SV40 polyA (nucs. 2893-3090); GRE 
promoter (nucs. 3114-3393); Ad5 E4-ORF6 cDNA (nucs. 3402- 
10 4286); SV40 polyA (nucs. 4315-4512); and 3 1 AAV ITR 

(nucs. 4547-4713). All other nucleotides are plasmid 
derived. 

The second generation rAAV construct was used to 
produce and purify rAAV virions which were exposed to 

15 HeLa cells that were left untreated or incubated with 
dexamethasone • In the absence of dexamethasone , (a 
condition under which little ORF6 should be expressed), 
little transduction was observed as measured by 
expression of the alkaline phosphatase gene. Cells 

20 incubated in dexamethasone expressed in ORF6 gene and the 
transduction efficacy was enhanced at least 5-fold. This 
provides evidence to support that a gene product 
expressed from the rAAV can function in cis to enhance 
expression of the transgene. 

25 Example 20 - Application to Bone Marrow Directed Gene 
Therapy 

Bone marrow directed gene therapy represents the 
paradigm of ex vivo gene therapy where the target cell is 
the hematopoietic stem cell. The basic strategy is to 

30 incorporate (i.e., integrate) a therapeutic minigene into 
the chromosomal DNA of hematopoietic stem cells which are 
transplanted into a recipient patient whose own bone 
marrow has been ablated allowing repopulation of its 
lymphohematopoietic system with progeny of the 

35 genetically corrected stem cell. 
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The problem with this approach has been efficiently 
transfecting genes into stem cells. Most studies of bone 
marrow directed gene therapy have utilized recombinant 
retroviruses which have not been very efficient. One 
5 problem is that retroviruses integrate their provirus 
only when the target cell is dividing. Unf ortunately , 
most stem cells in vitro are quiescent and not dividing. 
rAAV holds the promise of integrating the provirus more 
efficiently into non-dividing stem cells. However, 

10 purified rAAV is not very efficient with respect to 
integration when used alone. In cultured cells, 
integration is observed in less than 1% of the cells. 
The same conditions that activate the conversion of ss to 
ds genome also enhance the integration of the ds 

15 intermediate into the chromosomal DNA. 

Therefore, a desirable application of the methods 
and compositions of this invention is in bone marrow 
directed gene therapy. According to this method, stem 
cells are genetically modified with rAAV and an inducing 

20 agent ex vivo using the constructs and methods described 
above (see e.g., Example 19). Genetically modified stem:, 
cells are subsequently transplanted by conventional 
techniques . 

Numerous modifications and variations of the present 
25 invention are included in the above- identified 

specification and are expected to be obvious to one of 
skill in the art. Such modifications and alterations to 
the compositions and processes of the present invention, 
such as selections of different transgenes and plasmids 
3 0 for the construction of the packaging cell lines and 
rAds, or selection or dosage of the viruses or immune 
modulators, are believed to be encompassed in the scope 
of the claims appended hereto. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1521.. 2405 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CTGCATGTGT 


CAGAGGTTTT 


CACCGTCATC 


ACCGAAACGC 


GCGAGGCAGC 


50 


AAGCTTGGCA 


GAAATGGTTG 


AACTCCCGAG 


AGTGTCCTAC 


ACCTAGGGGA 


100 


GAAGCAGCCA 


AGGGGTTGTT 


TCCCACCAAG 


GACGACCCGT 


CTGCGCACAA 


150 


ACGGATGAGC 


C CAT C AG AC A 


AAGACATATT 


CATTCTCTGC 


TGCAAACTTG 


200 


GCATAGCTCT 


GCTTTGCCTG 


^» ^ /"«rT1 TV fill II/™*/"* 

GGGCTATTGG 


GGGAAGTTGC 


GG 1 1 v-G 1 GC X 


1 n 


CGCAGGGCTC 


TCACCCTTGA 


CTCTTTCAAT 


AATAACTCTT 


CTGTGCAAGA 


300 


TTACAATCTA 


AACAATTCGG 


AGAACTCGAC 


CTTCCTCCTG 


AGGCAAGGAC 


350 


CACAGCCAAC 


TTCCTCTTAC 


AAGCCGCATC 


GATTTTGTCC 


TTCAGAAATA 


400 


GAAATAAGAA 


TGCTTGCTAA 


AAATTATATT 


TTTACCAATA 


AGACCAATCC 


450 


AATAGGTAGA 


TTATTAGTTA 


CTATGTTAAG 


AAATGAATCA 


TTATCTTTTA 


500 


GTACTATTTT 


TACTCAAATT 


CAGAAGTTAG 


AAATGGGAAT 


AGAAAATAGA 


550 


AAGAGACGCT 


CAACCTCAAT 


TGAAGAACAG 


GTGCAAGGAC . 


TATTGACCAC 


600 


AGGCCTAGAA 


GTAAAAAAGG 


GAAAAAAGAG 


TGTTTTTGTC 


AAAATAGGAG 


650 


ACAGGTGGTG 


GCAACCAGGG 


ACTTATAGGG 


GACCTTACAT 


CTACAGACCA 


700 


ACAGATGCCC 


CCTTACCATA 


TACAGGAAGA 


TATGACTTAA 


ATTGGGATAG 


750 


GTGGGTTACA 


GTCAATGGCT 


ATAAAGTGTT 


ATATAGATCC 


CTCCCCTTTC 


800 


GTGAAAGACT 


CGCCAGAGCT 


AGACCTCCTT 


GGTGTATGTT 


GTCTCAAGAA 


850 


AAGAAAGACG 


ACATGAAACA 


ACAGGTACAT 


GATTATATTT 


ATCTAGGAAC 


900 
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AGGAATGCAC TTTTGGGGAA 
CTGGACTAAT AGAACATTAT 
GATTAGCCTT GATTTGCCCA 
TTTGGTTACA AACTGTTCTT 
GACTTGGTTT GGTATCAAAG 
TCCTATGTTC TTTTGGAATT 
ACCAAGATAT AAAAGAGTGC 
AACATTCACC TCTTGTGTGT 
GTCACTTATC CTTCACTTTC 
CTCAGGTCGG CCGACTGCGG 
AAGTGACCCT TGTCTTTATT 
TCTATCTTGT CTGGCTATCA 
AACCAAGCTA GCCCAATTCG 
ACACTACGAC CAACACGATC 
GGATCGTCTA CCTCCTTTTG 
ATCATCCGCT GCTGCCCGAA 
TACGTGCGAG GTCTTCCCTG 
GGTTGTTCCC TGGGATATGG 
GGAAGTGTAT GCACGTGTGC 
AGCATGATGA TCCATGGTTA 
CAGTCCCGGT TCCCTGCAGT 
GGTTTAGGAT GGTGGTGGAT 
TACCGGGAGG TGGTGAATTA 
CAGCGTGTTT ATGAGGGGTC 
ATGGCCACGT GGGTTCTGTG 
TTGCACTGTG GGATTTTGAA 



70 

AGATTTTCCA TACCAAGGAG GGGACAGTGG 950 

TCTGCAAAAA CTTATGGCAT GAGTTATTAT 1000 

ACCTTGCGGT TCCCAAGGCT TAAGTAAGTT 1050 

AAAACAAGGA TGTGAGACAA GTGGTTTCCT 1100 

GTTCTGATCT GAGCTCTGAG TGTTCTATTT 1150 

TATCCAAATC TTATGTAAAT GCTTATGTAA 1200 

TGATTTTTTG AGTAAACTTG CAACAGTCCT 1250 

TTGTGTCTGT TCGCCATCCC GTCTCCGCTC 13 00 

CAGAGGGTCC CCCCGCAGAC CCCGGCGACC 1350 

CAGCTGGCGC CCGAACAGGG ACCCTCGGAT 1400 

TCTACTATTT TGTGTTCGTC TTGTTTTGTC 1450 

TCACAAGAGC GGAACGGACT CACCTCAGGG 1500 

ATGACTACGT CCGGCGTTCC ATTTGGCATG 1550 

TCGGTTGTCT CGGCGCACTC CGTACAGTAG 1600 

AGACAGAAAC CCGCGCTACC ATACTGGAGG 1650 

TGTAACACTT TGACAATGCA CAACGTGAGT 1700 

CAGTGTGGGA TTTACGCTGA TTCAGGAATG 1750 

TTCTAACGCG GGAGGAGCTT GTAATCCTGA 1800 

CTGTGTTGTG CCAACATTGA TATCATGACG 1850 

CGAGTCCTGG GCTCTCCACT GTCATTGTTC 1900 

GTATAGCCGG CGGGCAGGTT TTGGCCAGCT 1950 

GGCGCCATGT TTAATCAGAG GTTTATATGG 2 000 

CAACATGCCA AAAGAGGTAA TGTTTATGTC 2 050 

GCCACTTAAT CTACCTGCGC TTGTGGTATG 2100 

GTCCCCGCCA TGAGCTTTGG ATACAGCGCC 2150 

CAATATTGTG GTGCTGTGCT GCAGTTACTG 2200 
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TGCTGATTTA AGTGAGATCA GGGTGCGCTG CTGTGCCCGG AGGACAAGGC 2250 

GCCTTATGCT GCGGGCGGTG CGAATCATCG CTGAGGAGAC CACTGCCATG 2300 

TTGTATTCCT GCAGGACGGA GCGGCGGCGG CAGCAGTTTA TTCGCGCGCT 23 50 

GCTGCAGCAC CACCGCCCTA TCCTGATGCA CGATTATGAC TCTACCCCCA 24 00 

TGTAGGGATC CAAGCTTGCG GGCGCATCGA TGATATCAAG CTTGCATGCC 2450 

TGCAGGTCGA CTCTAGAGGA TCCCGGGTGG NATCCCTGTG ACCCCTCCCC 2500 

AGTGCCTCTC CTGGCCCTGG AAGTTGGCAC TCCAGTGCCC ACCAGCCTTG 2550 

TCCTAATAAA" ATTAAGTTGN ATCATTTTGT CTGACTAGGT GTCCTTCTAT 2600 

AATATTATGG GGTGGAGGGG GGTGGTATGG AGCAANGGGN AANTTGGNAA 2 650 

GACAANCTGT AGGGCCTGCG GGGTCTATTG GGAACAAGCT GGAGTGCAGT 2700 t 

GGCACAATCT TGGCTCACTG CAATCTCCGC CTCCTGGGTT CAAGCGATTC 2750~ 

TCCTGCCTCA GACTCCCGAG TTGTTGGGAT TCCAGGCATG CATGACCAGG 2800 

CTCAGATAAT TTTTGTTTTT TTGGTAGAGA CGGGGTTTCA CCATATTGGN 2850 

CAGGCTGGTC TCCAACTCCT AATCTCAGGT GATCTNCCCA CCTTGGCCTC 290 Ot 

CCAAATTGCT GGGATTACAG GNGTGAACCA CTGNTCCCTT CCCTGTCCTT 2950 

CTGATTTTAA AATAACTATA CCAGCAGGAG GACGTCCAGA CACAGCATAG 3 000 

GCTACCTGGC CATGCCCAAC CGGTGGGACA TTTGAGTTGC TTGCTTGGCA 3050 

CTGTCCTCTC ATGCGTTGGG TCCACTCAGT AGATGCCTGT TGAATTGGGT 3100 

ACGCGGCCAG CTTGGCTGTG GAATGTGTGT CAGTTAGGGT GTGGAAAGTC 3150 

CCCAGGCTCC CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT 3200 

CAGCAACCAG GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG 3250 

CAAAGCATGC ATCTCAATTA GTCAGNAACC ATAGNCCCGC CCCTAACTCC 33 00 

GTCCATCCCG GCCCTAACTC NGGCCAGTTC CGACCNTNCT CCGGCNNATG 3 3 50 

GNTGAGTAAT TTGCNNGATT TATGCAGNGG GCGAGGNCGC CTCGGGCTCT 34 00 

GAGNTNTTCC AGAAGTAGTG AGGAGGCTTT NNTGGTGGAA TTGATCAGCT 3450 

TGGGATCTGA TCAAGAGACA GGATGAGGAT CGNNNCGNAT GATTGAACAA 3500 
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GATGGGTTGC ACGGAGGTTC TCCGGNCGCT TGGGTGGGGA GGNTATTCGG 3 550 

NTATTNTTGG TGNACAACAG NNAAACGGNT GTTCTGATGC CGCCGCGTTC 3 600 

NCGCTTTCAG NGCAGGGGGG CCCCCCTTCT NTTGAGANNA GCNCCCCTTN 3 650 

TTG 3 653 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Thr Ser Gly Val Pro Phe Gly Met Thr Leu Arg Pro Thr Arg 
1 5 10 15 

Ser Arg Leu Ser Arg Arg Thr Pro Tyr Ser Arg Asp Arg Leu Pro Pro 
20 25 30 

Phe Glu Thr Glu Thr Arg Ala Thr lie Leu Glu Asp His Pro Leu Leu 
35 40 45 

Pro Glu Cys Asn Thr Leu Thr Met His Asn Val Ser Tyr Val Arg Gly 
50 55 60 

Leu Pro Cys Ser Val Gly Phe Thr Leu lie Gin Glu Trp Val Val Pro 
65 70 75 80 

Trp Asp Met Val Leu Thr Arg Glu Glu Leu Val lie Leu Arg Lys Cys 

85 90 95 

Met His Val Cys Leu Cys Cys Ala Asn lie Asp lie Met Thr Ser Met 
100 105 110 

Met lie Tyr Gly Tyr Glu Ser Trp Ala Leu His Cys His Cys Ser Ser 
115 120 125 

Pro Gly Ser Leu Gin Cys lie Ala Gly Gly Gin Val Leu Ala Ser Trp 
130 135 140 

Phe Arg Met Val Val Asp Gly Ala Met Phe Asn Gin Arg Phe lie Trp 
145 150 155 160 
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Tyr Arg Glu Val Val Asn Tyr Asn Met Pro Lys Glu Val Met Phe Met 

165 170 175 

Ser Ser Val Phe Met Arg Gly Arg His Leu lie Tyr Leu Arg Tyr Trp 
180 185 190 

Tyr Asp Gly His Val Gly Ser Val Val Pro Ala Met Ser Phe Gly Tyr 
195 200 205 

Ser Ala Leu His Cys Gly lie Leu Asn Asn lie Val Val Leu Cys Cys 
210 215 220 

Ser Tyr Cys Ala Asp Leu Ser Glu lie Arg Val Arg Cys Cys Ala Arg 
225 230 235 240 

Arg Thr Arg Arg Leu Met Leu Arg Ala Val Arg He He Ala Glu Glu 

245 250 255 

Thr Thr Ala Met Leu Tyr Ser Cys Arg Thr Glu Arg Arg Arg Gin Gin 
260 265 270 

Phe lie Arg Ala Leu Leu Gin His His Arg Pro lie Leu Met His Asp 
275 280 285 

Tyr Asp Ser Thr Pro Met 
290 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 50 

GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 100 

TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 150 

GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 2 00 

GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 2 50 

CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 3 00 
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AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 3 50 

GGGAGATCAG CCTGCAGGTC GTTACATAAC TTACGGTAAA TGGCCCGCCT 400 

GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT 450 

TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT 500 

ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 550 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA 600 

TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACT 650 

CGAGGCCACG TTCTGCTTCA CTCTCCCCAT CTCCCCCCCC TCCCCACCCC 700 

CAATTTTGTA TTTATTTATT TTTTAATTAT TTTGTGCAGC GATGGGGGCG 750 

GGGGGGGGGG GGGGGCGCGC GCCAGGCGGG GCGGGGCGGG GCGAGGGGCG 800 

GGGCGGGGCG AGGCGGAGAG GTGCGGCGGC AGCCAATCAG AGCGGCGCGC 850 

TCCGAAAGTT TCCTTTTATG GCGAGGCGGC GGCGGCGGCG GCCCTATAAA 900 

AAGCGAAGCG CGCGGCGGGC GGGAGCGGGA TCAGCCACCG CGGTGGCGGC 950 

CGCAATTCCC GGGGATCGAA AGAGCCTGCT AAAGCAAAAA AGAAGTCACC 1000 

ATGTCGTTTA CTTTGACCAA CAAGAACGTG ATTTTCGTTG CCGGTCTGGG 1050 

AGGCATTGGT CTGGACACCA GCAAGGAGCT GCTCAAGCGC GATCCCGTCG 1100 

TTTTACAACG TCGTGACTGG GAAAACCCTG GCGTTACCCA ACTTAATCGC 1150 

CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG CGTAATAGCG AAGAGGCCCG 1200 

CACCGATCGC CCTTCCCAAC AGTTGCGCAG CCTGAATGGC GAATGGCGCT 1250 

TTGCCTGGTT TCCGGCACCA GAAGCGGTGC CGGAAAGCTG GCTGGAGTGC 1300 

GATCTTCCTG AGGCCGATAC TGTCGTCGTC CCCTCAAACT GGCAGATGCA 13 50 

CGGTTACGAT GCGCCCATCT ACACCAACGT AACCTATCCC ATTACGGTCA 1400 

ATCCGCCGTT TGTTCCCACG GAGAATCCGA CGGGTTGTTA CTCGCTCACA 1450 

TTTAATGTTG ATGAAAGCTG GCTACAGGAA GG CCAGACGC GAATTATTTT 1500 

TGATGGCGTT AACTCGGCGT TTCATCTGTG GTGCAACGGG CGCTGGGTCG 1550 

GTTACGGCCA GGACAGTCGT TTGCCGTCTG AATTTGACCT GAGCGCATTT 1600 
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TTACGCGCCG GAGAAAACCG 
CGGCAGTTAT CTGGAAGATC 
GTGACGTCTC GTTGCTGCAT 
GTTGCCACTC GCTTTAATGA 
AGTTCAGATG TGCGGCGAGT 
TATGGCAGGG TGAAACGCAG 
GAAATTATCG ATGAG CGTGG 
GAACGTCGAA AACCCGAAAC 
GTGCGGTGGT TGAACTGCAC 
GCCTGCGATG TCGGTTTCCG 
GCTGAACGGC AAGCCGTTGC 
ATCCTCTGCA TGGTCAGGTC 
CTGCTGATGA AGCAGAACAA 
GAACCATCCG CTGTGGTACA 
TGGATGAAGC CAATATTGAA 
ACCGATGATC CGCGCTGGCT 
GGTGCAGCGC GATCGTAATC 
ATGAATCAGG CCACGGCGCT 
TCTGTCGATC CTTCCCGCCC 
CACGGCCACC GATATTATTT 
AGCCCTTCCC GGCTGTGCCG 
CCTGGAGAGA CGCGCCCGCT 
TAACAGTCTT GGCGGTTTCG 
CCCGTTTACA GGGCGGCTTC 
AAATATGATG AAAACGGCAA 
CGATACGCCG AACGATCGCC 



75 

CCTCGCGGTG ATGGTGCTGC 
AGGATATGTG GCGGATGAGC 
AAACCGACTA CACAAATCAG 
TGATTTCAGC CGCGCTGTAC 
TGCGTGACTA CCTACGGGTA 
GTCGCCAGCG GCACCGCGCC 
TGGTTATGCC GATCGCGTCA 
TCTGGAGCGC CGAAATCCCG 
ACCGCCGACG GCACGCTGAT 
CGAGGTGCGG ATTGAAAATG 
TGATT CGAGG CGTTAAC CGT 
ATGGATGAGC AGACGATGGT 
CTTTAACGCC GTGCGCTGTT 
CGCTGTGCGA CCGCTACGGC 
ACCCACGGCA TGGTGCCAAT 
ACCGGCGATG AGCGAACGCG 
ACCCGAGTGT GATCATCTGG 
AATCACGACG CGCTGTATCG 
GGTGCAGTAT GAAGGCGGCG 
GCCCGATGTA CGCGCGCGTG 
AAATGGTCCA TCAAAAAATG 
GATCCTTTGC GAATACGCCC 
CTAAATACTG GCAGGCGTTT 
GTCTGGGACT GGGTGGATCA 
CCCGTGGTCG GCTTACGGCG 
AGTTCTGTAT GAACGGTCTG 



GTTGGAGTGA 1650 

GGCATTTTCC 1700 

CGATTTCCAT 1750 

TGGAGGCTGA 1800 

ACAGTTTCTT 1850 

TTTCGGCGGT 1900 

CACTACGTCT 1950 

AATCTCTATC 2 000 

TGAAGCAGAA 2 050 

GTCTGCTGCT 2100 

CACGAGCATC 2150 

GCAGGATATC 2200 

CGCATTATCC 2250 

CTGTATGTGG 23 00 

GAATCGTCTG 2 3 50 

TAACGCGAAT 24 00 

TCGCTGGGGA 2 450 

CTGGATCAAA 2500 

GAGCCGACAC 2 550 

GATGAAGACC 2 600 

GCTTTCGCTA 2 650 

ACGCGATGGG 27 00 

CGTCAGTATC 2750 

GTCGCTGATT 2800 

GTGATTTTGG 2850 

GTCTTTGCCG 2900 
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ACCGCACGCC 


GCATCCAGCG 


CTGACGGAAG 


CAAAACACCA 


GCAGCAGTTT 


2950 


TTCCAGTTCC 


GTTTATCCGG 


GCAAACCATC 


G AAGTG AC C A 


GCGAATACCT 


3000 


GTTCCGTCAT 


AGCGATAACG 


AGCTCCTGCA 


CTGGATGGTG 


GCGCTGGATG 


3050 


GTAAGCCGCT 


GGCAAGCGGT 


GAAGTGCCTC 


TGGATGTCGC 


TCCACAAGGT 


3100 


AAACAGTTGA 


TTGAACTGCC 


TGAACTACCG 


CAGCCGGAGA 


GCGCCGGGCA 


3150 


ACTCTGGCTC 


ACAGTACGCG 


TAGTGCAACC 


GAACGCGACC 


GCATGGTCAG 


3200 


AAGCCGGGCA 


CATCAGCGCC 


TGGCAGCAGT 


GGCGTCTGGC 


GGAAAACCTC 


3250 


AGTGTGACGC 


TCCCCGCCGC 


GTCCCACGCC 


ATCCCGCATC 


TGACCACCAG 


3300 


CGAAATGGAT 


TTTTGCATCG 


AGCTGGGTAA 


TAAGCGTTGG 


CAATTTAACC 


3350 


GCCAGTCAGG 


CTTTCTTTCA 


CAGATGTGGA 


TTGGCGATAA 


AAAACAACTG 


3400 


CTGACGCCGC 


TGCGCGATCA 


GTTCACCCGT 


GCACCGCTGG 


ATAACGACAT 


3450 


TGGCGTAAGT 


GAAGCGACCC 


GCATTGACCC 


TAACGCCTGG 


GTCGAACGCT 


3500 


GGAAGGCGGC 


GGGCCATTAC 


CAGGCCGAAG 


CAGCGTTGTT 


GCAGTGCACG 


3550 


GCAGATACAC 


TTGCTGATGC 


GGTGCTGATT 


ACGACCGCTC 


ACGCGTGGCA 


3600 


GCATCAGGGG 


AAAACCTTAT 


TTATCAGCCG 


GAAAAC CT AC 


CGGATTGATG 


3650 


GTAGTGGTCA 


AATGGCGATT 


ACCGTTGATG 


TTGAAGTGGC 


GAGCGATACA 


3700 


CCGCATCCGG 


CGCGGATTGG 


CCTGAACTGC 


CAGCTGGCGC 


AGGTAGCAGA 


3750 


GCGGGTAAAC 


TGGCTCGGAT 


TAGGGCCGCA 


AGAAAACTAT 


CCCGACCGCC 


3800 


TTACTGCCGC 


CTGTTTTGAC 


CGCTGGGATC 


TGCCATTGTC 


AGACATGTAT 


3850 


ACCCCGTACG 


TCTTCCCGAG 


CGAAAACGGT 


CTGCGCTGCG 


GGACGCGCGA 


3900 


ATTGAATTAT 


GGCCCACACC 


AGTGGCGCGG 


CGACTTCCAG 


TTCAACATCA 


3950 


GCCGCTACAG 


TCAACAGCAA 


CTGATGGAAA 


CCAGCCATCG 


CCATCTGCTG 


4000 


CACGCGGAAG 


AAGGCACATG 


GCTGAATATC 


GACGGTTTCC 


ATATGGGGAT 


4050 


TGGTGGCGAC 


GACTCCTGGA 


GCCCGTCAGT 


ATCGGCGGAA 


TTACAGCTGA 


4100 


GCGCCGGTCG 


CTACCATTAC 


CAGTTGGTCT 


GGTGTCAAAA 


ATAATAATAA 


4150 


CCGGGCAGGC 


CATGTCTGCC 


CGTATTTCGC 


GTAAGGAAAT 


CCATTATGTA 


4200 
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CTATTTAAAA AACACAAACT TTTGGATGTT 
ACTTTTTTAT CATGGGAGCC TACTTCCCGT 
GACATCAACC ATATCAGCAA AAGTGATACG 
TTCTCTGTTC TCGCTATTAT TCCAACCGCT 
AACTCGGCCT CGACTCTAGG CGGCCGCGGG 
CATTGATGAG TTTGGACAAA CCACAACTAG 
TTATTTGTGA AATTTGTGAT GCTATTGCTT 
TGCAATAAAC AAGTTAACAA CAACAATTGC 
TCAGGGGGAG GTGTGGGAGG TTTTTTCGGA 
GGCTGATCAG TGGAAGGTGC TGAGGTACGA 
GACCCTGCGA GTGTGGCGGT AAACATATTA 
GATGTGACCG AGGAGCTGAG GCCCGATCAC 
CGCTGAGTTT GGCTCTAGCG ATGAAGATAC 
GTGGGCGTGG CTTAAGGGTG GGAAAGAATA 
TAGTTTTGTA TCTGTTTTGC AGCAGCCGCC 
GTTTGATGGA AGCATTGTGA GCTCATATTT 
GGGCCGGGGT GCGTCAGAAT GTGATGGGCT 
GTCCTGCCCG CAAACTCTAC TACCTTGACC 
GCCGTTGGAG ACTGCAGCCT CCGCCGCCGC 
CCCGCGGGAT TGTGACTGAC TTTGCTTTCC 
GCAGCTTCCC GTTCATCCGC CCGCGATGAC 
ACAATTGGAT TCTTTGACCC GGGAACTTAA 
TGGATCTGCG CCAGCAGGTT TCTGCCCTGA 
GCGGTTTAAA ACATAAATAA AAAACCAGAC 
GCAAGTGTCT TGCTGTCTTT ATTTAGGGGT 
GGGACCAGCG GTCTCGGTCG TTGAGGGTCC 



CGGTTTATTC 


TTTTTCTTTT 


4250 


TTTTCCCGAT 


TTGGCTACAT 


4300 


GGTATTATTT 


TTGCCGCTAT 


4350 


GTTTGGTCTG 


CTTTCTGACA 


4400 


GATCCAGACA 


TGATAAGATA 


4450 


AATGCAGTGA 


AAAAAATGCT 


4500 


TATTTGTAAC 


CATTATAAGC 


4550 


ATTCATTTTA 


TGTTTCAGGT 


4 600 


TCCTCTAGAG 


TCGACCTGCA 


4 650 


TGAGACCCGC 


ACCAGGTGCA 


4700 


GGAACCAGCC 


TGTGATGCTG 


4750 


TTGGTGCTGG 


CCTGCACCCG 


4 8 00 


AGATTGAGGT 


ACTGAAATGT 


4 850 


TATAAGGTGG 


GGGTCTTATG 


4900 


GCCGCCATGA 


GCACCAACTC 


4950 


GACAACGCGC 


ATGCCCCCAT 


5000 


CCAGCATTGA 


TGGTCGCCCC 


5050 


TACGAGACCG 


TGTCTGGAAC 


5100 


TTCAGCCGCT 


GCAGCCACCG 


5150 


TGAGCCCGCT 


TGCAAGCAGT 


5200 


AAGTTGACGG 


CTCTTTTGGC 


5250 


TGTCGTTTCT 


CAGCAGCTGT 


5300 


AGGCTTCCTC 


CCCTCCCAAT 


5350 


TCTGTTTGGA 


TTTGGATCAA 


5400 


TTTGCGCGCG 


CGGTAGGCCC 


5450 


TGTGTATTTT 


TTCCAGGACG 


5500 
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TGGTAAAGGT GACTCTGGAT GTTCAGATAC 
GGGGTGGAGG TAGCACCACT GCAGAGCTTC 
AGATGATCCA GTCGTAGCAG GAGCGCTGGG 
TTCAGTAGCA AGCTGATTGC CAGGGGCAGG 
AAAGCGGTTA AGCTGGGATG GGTGCATACG 
TGGACTGTAT TTTTAGGTTG GCTATGTTCC 
TTCATGTTGT GCAGAACCAC CAGCACAGTG 
TTTGTCATGT AGCTTAGAAG GAAATGCGTG 
TGTGACCTCC AAGATTTTCC ATGCATTCGT 
CCACGGGCGG CGGCCTGGGC GAAGATATTT 
GTTGTGTTCC AGGATGAGAT CGTCATAGGC 
GGAGGGTGCC AGACTGCGGT ATAATGGTTC 
TTACCCTCAC AGATTTGCAT TTCCCACGCT 
CATGTCTACC TGCGGGGCGA TGAAGAAAAC 
TCAGCTGGGA AGAAAGCAGG TTCCTGAGCA 
GTGGGCCCGT AAATCACACC TATTACCGGG 
GCTGCAGCTG CCGTCATCCC TGAGCAGGGG 
CCCTGACTCG CATGTTTTCC CTGACCAAAT 
CCCAGCGATA GCAGTTCTTG CAAGGAAGCA 
ACCGTCCGCC GTAGGCATGC TTTTGAGCGT 
GGTCCCACAG CTCGGTCACC TGCTCTACGG 
CCTCGTTTCG CGGGTTGGGG CGGCTTTCGC 
TCGTCCAGAC GGGCCAGGGT CATGTCTTTC 
CAGCGTAGTC TGGGTCACGG TGAAGGGGTG 
CCAGGGTGCG CTTGAGGCTG GTCCTGCTGG 
TCGCCCTGCG CGTCGGCCAG GTAGCATTTG 



ATGGGCATAA GCCCGTCTCT 5550 

ATGCTGCGGG GTGGTGTTGT 5600 

CGTGGTGCCT AAAAATGTCT 5650 

CCCTTGGTGT AAGTGTTTAC 5700 

TGGGGATATG AGATGCATCT 5750 

CAGCCATATC CCTCCGGGGA 5800 

TATCCGGTGC ACTTGGGAAA 5850 

GAAGAACTTG GAGACGCCCT 59 00 

CCATAATGAT GGCAATGGGC 5950 

CTGGGATCAC TAACGTCATA 6000 

CATTTTTACA AAGCGCGGGC 6050 

CATCCGGCCC AGGGGCGTAG 6100 

TTGAGTTCAG ATGGGGGGAT 6150 

GGTTTCCGGG GTAGGGGAGA 62 00 

GCTGCGACTT ACCGCAGCCG 6250 

TGCAACTGGT AGTTAAGAGA 63 00 

GGCCACTTCG TTAAGCATGT 63 50 

CCGCCAGAAG GCGCTCGCCG 6400 

AAGTTTTTCA ACGGTTTGAG 6450 

TTGACCAAGC AGTTCCAGGC 6500 

CATCTCGATC C AG C AT ATCT 6550 

TGTACGGCAG TAGTCGGTGC 6600 

CACGGGCGCA GGGTCCTCGT 6650 

CGCTCCGGGC TGCGCGCTGG 67 00 

TGCTGAAGCG CTGCCGGTCT 6750 

ACCATGGTGT CATAGTCCAG 6800 
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CCCCTCCGCG GCGTGGCCCT TGGCGCGCAG 
CGCACGAGGG GCAGTGCAGA CTTTTGAGGG 
AATACCGATT CCGGGGAGTA GGCATCCGCG 
CTCGCATTCC ACGAGCCAGG TGAGCTCTGG 
GGTTTCCCCC ATGCTTTTTG ATGCGTTTCT 
CGGTGTCCAC GCTCGGTGAC GAAAAGGCTG 
CTTGAGAGGC CTGTCCTCGA GCGGTGTTCC 
ACTCGGACCA CTCTGAGACA AAGGCTCGCG 
GCTAAGTGGG AGGGGTAGCG GTCGTTGTCC 
CAGGGTGTGA AGACACATGT CGCCCTCTTC 
GTTTGTAGGT GTAGGCCACG TGACCGGGTG 
AAGGGGGTGG GGGGGCGTTC GTCCTCAGTC 
GAGGGCCAGC TGTTGGGGTG AGTACTCCCT 
CTGCGCTAAG ATTGTCAGTT TCCAAAAACG 
TGGCCCGCGG TGATGCCTTT GAGGGTGGCC 
GACAATCTTT TTGTTGTCAA GCTTGGTGGC 
TGGAC AG CAA CTTGGCGATG GAGCGCAGGG 
GCGCGCTCCT TGGCCGCGAT GTTTAGCTGC 
CCGCCATTCG GGAAAGACGG TGGTGCGCTC 
GCCAACCGCG GTTGTGCAGG GTGACAAGGT 
CCGCGTAGGC GCTCGTTGGT CCAGCAGAGG 
GAATGGCGGT AGGGGGTCTA GCTGCGTCTC 
CGGTAAAGAC CCCGGGCAGC AGGCGCGCGT 
CCTTGCAAGT CTAGCGCCTG CTGCCATGCG 
GTATGGGTTG AGTGGGGGAC CCCATGGCAT 
CGTACATGCC GCAAATGTCG TAAACGTAGA 



CTTGCCCTTG GAGGAGGCGC 6850 

CGTAGAGCTT GGGCGCGAGA 6900 

CCGCAGGCCC CGCAGACGGT 6950 

CCGTTCGGGG TCAAAAACCA 7000 

TACCTCTGGT TTCCATGAGC 7050 

TCCGTGTCCC CGTATACAGA 7100 

GCGGTCCTCC TCGTATAGAA 7150 

TCCAGGCCAG CACGAAGGAG 7200 

ACTAGGGGGT CCACTCGCTC 7250 

GGCATCAAGG AAGGTGATTG 7300 

TTCCTGAAGG GGGGCTATAA 7350 

TCTTCCGCAT CGCTGTCTGC 7400 

CTGAAAAGCG GGCATGACTT 74 50 

AGGAGGATTT GATATTCACC 7500: 

GCATCCATCT GGTCAGAAAA 7550~ 

AAACGACCCG TAGAGGGCGT 7600 

TTTGGTTTTT GTCGCGATCG 7650 

ACGTATTCGC GCGCAACGCA 7700 

GTCGGGCACC AGGTGCACGC 7750 

CAACGCTGGT GGCTACCTCT 7800 

CGGCCGCCCT TGCGCGAGCA 7850 

GTCCGGGGGG TCTGCGTCCA 7 900 

CGAAGTAGTC TATCTTGCAT 7950 

CGGGCGG CAA GCGCGCGCTC 8 000 

GGGGTGGGTG AGCGCGGAGG 8050 

GGGGCTCTCT GAGTATTCCA 8100 
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AGATATGTAG GGTAGCATCT TCCACCGCGG 
GTATAGTTCG TGCGAGGGAG CGAGGAGGTC 
CGGGCTGCTC TGCTCGGAAG ACTATCTGCC 
GATGATATGG TTGGACGCTG GAAGACGTTG 
TACCGCGTCA CGCACGAAGG AGGCGTAGGA 
GCTCGGCGGT GACCTGCACG TCTAGGGCGC 
ATGATGTCAT ACTTATCCTG TCCCTTTTTT 
GACAAACTCT TCGCGGTCTT TCCAGTACTC 
CCTCCGAACG GTAAGAGCCT AGCATGTAGA 
GCGCAGCATC CCTTTTCTAC GGGTAGCGCG 
GAGCGAGGTG TGGGTGAGCG CAAAGGTGTC 
ACTGGTATTT GAAGTCAGTG TCGTCGCATC 
AAGTCCGTGC GCTTTTTGGA ACGCGGATTT 
GTTGAAGAGT ATCTTTCCCG CGCGAGGCAT 
AGGGTCCCGG CACCTCGGAA CGGTTGTTAA 
ATCTCGTCAA AGCCGTTGAT GTTGTGGCCC 
GCGCGGGATG CCCTTGATGG AAGGCAATTT 
GCTCTTCAGG GGAGCTGAGC CCGTGCTCTG 
TGAGGGTTGG AAGCGACGAA TGAGCTCCAC 
TTGCAGGTGG TCGCGAAAGG TCCTAAACTG 
CTGGGGTGAT GCAGTAGAAG GTAAGCGGGT 
CCAAGGTTCG CGGCTAGGTC TCGCGCGGCA 
GCCGAACTTC ATGACCAGCA TGAAGGGCAC 
CCATCCAAGT ATAGGTCTCT ACATCGTAGG 
CGAGGATGCG AGCCGATCGG GAAGAACTGG 
GGAGTGGCTA TTGATGTGGT GAAAGTAGAA 



PCT/US96/10245 



ATGCTGGCGC GCACGTAATC 8150 

GGGACCGAGG TTGCTACGGG 8200 

TGAAGATGGC ATGTGAGTTG 8250 

AAGCTGGCGT CTGTGAGACC 83 00 

GTCGCGCAGC TTGTTGACCA 8350 

AGTAGTCCAG GGTTTCCTTG 8400 

TTCCACAGCT CGCGGTTGAG 8450 

TTGGATCGGA AACCCGTCGG 8500 

ACTGGTTGAC GGCCTGGTAG 8550 

TATGCCTGCG CGGCCTTCCG 8600 

CCTGACCATG ACTTTGAGGT 8650 

CGCCCTGCTC CCAGAGCAAA 8700 

GGCAGGGCGA AGGTGACATC 8750 

AAAGTTGCGT GTGATGCGGA 8800 

TTACCTGGGC GGCGAGCACG 8850 

ACAATGTAAA GTTCCAAGAA 8900 

TTTAAGTTCC TCGTAGGTGA 8950 

AAAGGGCCCA GTCTGCAAGA 9000 

AGGTCACGGG CCATTAGCAT 9 050 

GCGACCTATG GCCATTTTTT 9100 

CTTGTTCCCA GCGGTCCCAT 9150 

GTCACTAGAG GCTCATCTCC 92 00 

GAGCTGCTTC CCAAAGGCCC 92 50 

TGACAAAGAG ACGCTCGGTG 93 00 

ATCTCCCGCC ACCAATTGGA 93 50 

GTCCCTGCGA CGGGCCGAAC 9400 
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ACTCGTGCTG GCTTTTGTAA 
GGCTGTACAT CCTGCACGAG 
GAGTGGGAAT TTGAGCCCCT 
CTTCGGCTGC TTGTCCTTGA 
GATCGGACCA CCACGCCGCG 
CGGTCGGAGC TTGATGACAA 
GGAGCTCCCG CGGCGTCAGG 
CATAGACGGG TCAGGGCGCG 
GGGCTGGTTG GTGGCGGCGT 
GCGCGACTAC GGTACCGCGC 
GATGATGCAT CTAAAAGCGG 
GGCTCCGGAC CCGCCGGGAG 
GGGCAGGAGC TGGTGCTGCG 
GGCGGTTGAT CTCCTGAATC 
GTGAGCTTGA GCCTGAAAGA 
GACGGCGGCC TGGCGCAAAA 
AGGCGATCTC GGCCATGAAC 
CGTCCGGCTC GCTCCACGGT 
GAGCTGCGAG AAGGCGTTGA 
CCACGCCCCC TTCGGCATCG 
AGCTCCACGT GCCGGGCGAA 
GTAGTTGAGG GTGGTGGCGG 
AGCGTCGCAA CGTGGATTCG 
ATGGCCTCGT AGAAGTCCAC 
CGACACGGTT AACTCCTCCT 
CGCGCACCTC GCGCTCAAAG 



81 

AAACGTG CGC AGTACTGGCA 
GTTGACCTGA CGACCGCGCA 
CGCCTGGCGG GTTTGGCTGG 
CCGTCTGGCT GCTCGAGGGG 
CGAGCCCAAA GTCCAGATGT 
CATCGCGCAG ATGGGAGCTG 
TCAGGCGGGA GCTCCTGCAG 
GGCTAGATCC AGGTGATACC 
CGATGGCTTG CAAGAGGCCG 
GGCGGGCGGT GGGCCGCGGG 
TGACGCGGGC GAGCCCCCGG 
AGGGGGCAGG GGCACGTCGG 
CGCGTAGGTT GCTGGCGAAC 
TGGCGCCTCT GCGTGAAGAC 
GAGTTCGACA GAATCAATTT 
TCTCCTGCAC GTCTCCTGAG 
TGCTCGATCT CTTCCTCCTG 
GGCGGCGAGG TCGTTGGAAA 
GGCCTCCCTC GTTCCAGACG 
CGGGCGCGCA TGACCACCTG 
GACGGCGTAG TTTCGCAGGC 
TGTGTTCTGC CACGAAGAAG 
TTGATATCCC CCAAGGCCTC 
GGCGAAGTTG AAAAACTGGG 
CCAGAAGACG GATGAGCTCG 
GCTACAGGGG CCTCTTCTTC 



GCGGTGCACG 9450 
CAAGGAAGCA 9 5 00 
TGGTCTTCTA 9550 
AGTTACGGTG 9600 
CCGCGCGCGG 9650 
TCCATGGTCT 9700 
GTTTACCTCG 9750 
TAATTTCCAG 9800 
CATCCCCGCG 9850 
GGTGTCCTTG 9900 
AGGTAGGGGG 9950, 

CGCCGCGCGC 10000 

GCGACGACGC 1005 0 

GACGGGCCCG 1010 0 : 

CGGTGTCGTT 10150' 

TTGTCTTGAT 10200 

GAGATCTCCG 10250 

TGCGGGCCAT 10300 

CGGCTGTAGA 10350 

CGCGAGATTG 104 00 

GCTGAAAGAG 10450 

TACATAACCC 10500 

AAGGCGCTCC 10550 

AGTTGCGCGC 10600 

GCGACAGTGT 10650 

TTCTTCAATC 10700 
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TCCTCTTCCA TAAGGGCCTC CCCTTCTTCT 
AGGGGGGACA CGGCGGCGAC GACGGCGCAC 
GCTCGATCAT CTCCCCGCGG CGACGGCGCA 
CCGTTCTCGC GGGGGCGCAG TTGGAAGACG 
ATGGGTTGGC GGGGGGCTGC CATGCGGCAG 
ATCTCAACAA TTGTTGTGTA GGTACTCCGC 
TCCGCATCGA CCGGATCGGA AAACCTCTCG 
ACAGTCGCAA GGTAGGCTGA GCACCGTGGC 
CGGGGTTGTT TCTGGCGGAG GTGCTGCTGA 
GTCTTGAGAC GGCGGATGGT CGACAGAAGC 
CTGCTGAATG CGCAGGCGGT CGGCCATGCC 
GGCGCAGGTC TTTGTAGTAG TCTTGCATGA 
TCTTCTCCTT CCTCTTGTCC TGCATCTCTT 
GGCGGAGTTT GGCCGTAGGT GGCGCCCTCT 
CGAAGCCCCT CATGGGCTGA AGCAGGGCTA 
GCTAATATGG CCTGCTGCAC CTGCGTGAGG 
GTCCACAAAG CGGTGGTATG CGCCCGTGTT 
CCATAACGGA CCAGTTAACG GTCTGGTGAC 
TACCTGAGAC GCGAGTAAGC CCTCGAGTCA 
CCGCACCAGG TACTGGTATC CCACCAAAAA 
AGAGGGGCCA GCGTAGGGTG GCCGGGGCTC 
ATAAGGCGAT GATATCCGTA GATGTACCTG 
GGCGGTGGTG GAGGCGCGCG GAAAGTCGCG 
GCAGCGGCAA AAAGTGCTCC ATGGTCGGGA 
GCGCAATCGT TGACGCTCTA GACCGTGCAA 
CACTCTTCCG TGGTCTGGTG GATAAATTCG 



TCTTCTGGCG 


GCGGTGGGGG 


10750 


CGGGAGGCGG 


TCGACAAAGC 


10800 


TGGTCTCGGT 


GACGGCGCGG 


10850 


CCGCCCGTCA 


TGTCCCGGTT 


10900 


GGATACGGCG 


CTAACGATGC 


10950 


CGCCGAGGGA 


CCTGAGCGAG 


11000 


AGAAAGGCGT 


CTAACCAGTC 


11050 


GGGCGGCAGC 


GGGCGGCGGT 


11100 


TGATGTAATT 


AAAGTAGGCG 


11150 


ACCATGTCCT 


TGGGTCCGGC 


11200 


CCAGGCTTCG 


TTTTGACATC 


11250 


GCCTTTCTAC 


CGGCACTTCT 


11300 


GCATCTATCG 


CTGCGGCGGC 


11350 


TCCTCCCATG 


CGTGTGACCC 


11400 


GGTCGGCGAC 


AACGCGCTCG 


11450 


GTAGACTGGA 


AGTCATCCAT 


11500 


GATGGTGTAA 


GTGCAGTTGG 


11550 


CCGGCTGCGA 


GAGCTCGGTG 


11600 


AATACGTAGT 


CGTTGCAAGT 


11650 


GTGCGGCGGC 


GGCTGGCGGT 


11700 


CGGGGGCGAG 


ATCTTCCAAC 


11750 


GACATCCAGG 


TGATGCCGGC 


11800 


GACGCGGTTC 


CAGATGTTGC 


11850 


CGCTCTGGCC 


GGTCAGGCGC 


11900 


AAGGAGAGCC 


TGTAAGCGGG 


11950 


CAAGGGTATC 


ATGGCGGACG 


12000 
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ACCGGGGTTC GAGCCCCGTA TCCGGCCGTC CGCCGTGATC CATGCGGTTA 12 050 

CCGCCCGCGT GTCGAACCCA GGTGTGCGAC GTCAGACAAC GGGGGAGTGC 12100 

TCCTTTTGGC TTCCTTCCAG GCGCGGCGGC TGCTGCGCTA GCTTTTTTGG 12150 

CCACTGGCCG CGCGCAGCGT AAGCGGTTAG GCTGGAAAGC GAAAGCATTA 12 200 

AGTGGCTCGC TCCCTGTAGC CGGAGGGTTA TTTTCCAAGG GTTGAGTCGC 12250 

GGGACCCCCG GTTCGAGTCT CGGACCGGCC GGACTGCGGC GAACGGGGGT 12 3 00 

TTGCCTCCCC GTCATGCAAG ACCCCGCTTG CAAATTCCTC CGGAAACAGG 12 350 

GACGAGCCCC TTTTTTGCTT TTCCCAGATG CATCCGGTGC TGCGGCAGAT 12400 

GCGCCCCCCT CCTCAGCAGC GGCAAGAGCA AGAGCAGCGG CAGACATGCA 12450 

GGGCACCCTC CCCTCCTCCT ACCGCGTCAG GAGGGGCGAC ATCCGCGGTT 12500 

GACGCGGCAG CAGATGGTGA TTACGAACCC CCGCGGCGCC GGGCCCGGCA 12550 

CTACCTGGAC TTGGAGGAGG GCGAGGGCCT GGCGCGGCTA GGAGCGCCCT 12 600 

CTCCTGAGCG GTACCCAAGG GTGCAGCTGA AGCGTGATAC GCGTGAGGCG 12 650 

TACGTGCCGC GGCAGAACCT GTTTCGCGAC CGCGAGGGAG AGGAGCCCGA 12700 

GGAGATGCGG GAT CGAAAGT TCCACGCAGG GCGCGAGCTG CGGCATGGCC 12750 

TGAATCGCGA GCGGTTGCTG CGCGAGGAGG ACTTTGAGCC CGACGCGCGA 12800- 

ACCGGGATTA GTCCCGCGCG CGCACACGTG GCGGCCGCCG ACCTGGTAAC 12850 ' 

CGCATACGAG CAGACGGTGA ACCAGGAGAT TAACTTTCAA AAAAGCTTTA 12900 

ACAACCACGT GCGTACGCTT GTGGCGCGCG AGGAGGTGGC TATAGGACTG 12 950 

ATGCATCTGT GGGACTTTGT AAGCGCGCTG GAGCAAAACC CAAATAGCAA 13 000 

GCCGCTCATG GCGCAGCTGT TCCTTATAGT GCAGCACAGC AGGGACAACG 13 050 

AGG C ATT C AG GGATGCGCTG CTAAACATAG TAGAGCCCGA GGGCCGCTGG 13100 

CTGCTCGATT TGATAAACAT CCTGCAGAGC ATAGTGGTGC AGGAGCGCAG 13150 

CTTGAGCCTG GCTGACAAGG TGGCCGCCAT CAACTATTCC ATGCTTAGCC 132 00 

TGGGCAAGTT TTACGCCCGC AAGATATACC ATACCCCTTA CGTTCCCATA 13 250 

GACAAGGAGG TAAAGATCGA GGGGTTCTAC ATGCGCATGG CGCTGAAGGT 13 3 00 



WO 96/39530 

84 

GCTTACCTTG AGCGACGACC TGGGCGTTTA 
AGGCCGTGAG CGTGAGCCGG CGGCGCGAGC 
CACAGCCTGC AAAGGGCCCT GGCTGGCACG 
CGAGTCCTAC TTTGACGCGG GCGCTGACCT 
GCGCCCTGGA GGCAGCTGGG GCCGGACCTG 
CGCGCTGGCA ACGTCGGCGG CGTGGAGGAA 
CGAGCCAGAG GACGGCGAGT ACTAAGCGGT 
GCAAGACGCA ACGGACCCGG CGGTGCGGGC 
CCGGCCTTAA CTCCACGGAC GACTGGCGCC 
TCGCTGACTG CGCGCAATCC TGACGCGTTC 
CCGGCTCTCC GCAATTCTGG AAGCGGTGGT 
CGCACGAGAA GGTGCTGGCG ATCGTAAACG 
ATCCGGCCCG ACGAGGCCGG CCTGGTCTAC 
GGCTCGTTAC AACAGCGGCA ACGTGCAGAC 
GGGATGTGCG CGAGGCCGTG GCGCAGCGTG 
AACCTGGGCT CCATGGTTGC ACTAAACGCC 
CAACGTGCCG CGGGGACAGG AGGACTACAC 
GGCTAATGGT GACTGAGACA CCGCAAAGTG 
GACTATTTTT TCCAGACCAG T AG AC AAGG C 
CCAGGCTTTC AAAAACTTGC AGGGGCTGTG 
GCGACCGCGC GACCGTGTCT AGCTTGCTGA 
CTGCTGCTAA TAGCGCCCTT CACGGACAGT 
ATACCTAGGT CACTTGCTGA CACTGTACCG 
ATGTGGACGA GCATACTTTC CAGGAGATTA 
GGGCAGGAGG ACACGGGCAG CCTGGAGGCA 
CAACCGGCGG CAGAAGATCC CCTCGTTGCA 
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TCGCAACGAG CGCATCCACA 13350 

TCAGCGACCG CGAGCTGATG 13400 

GGCAGCGGCG ATAGAGAGGC 13450 

GCGCTGGGCC CCAAGCCGAC 13500 

GGCTGGCGGT GGCACCCGCG 13550 

TATGACGAGG ACGATGAGTA 13 600 

GATGTTTCTG ATCAGATGAT 13 650 

GGCGCTGCAG AGCCAGCCGT 13 700 

AGGTCATGGA CCGCATCATG 13750 

CGGCAGCAGC CGCAGGCCAA 13800 

CCCGGCGCGC GCAAACCCCA 13850 

CGCTGGCCGA AAACAGGGCC 13 900 

GACGCGCTGC TTCAGCGCGT 13950 

CAACCTGGAC CGGCTGGTGG 14000 

AGCGCGCGCA GCAGCAGGGC 14 050 

TTCCTGAGTA CACAGCCCGC 14100 

CAACTTTGTG AGCGCACTGC 14150 

AGGTGTACCA GTCTGGGCCA 142 00 

CTGCAGACCG TAAACCTGAG 14250 

GGGGGTGCGG GCTCCCACAG 14300 

CGCCCAACTC GCGCCTGTTG 143 50 

GGCAGCGTGT CCCGGGACAC 14400 

CGAGGCCATA GGTCAGGCGC 14450 

C AAGTGT C AG CCGCGCGCTG 14500 

ACCCTAAACT ACCTGCTGAC 14550 

CAGTTTAAAC AGCGAGGAGG 14600 
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AGCGCATTTT GCGCTACGTG 
GACGGGGTAA CGCCCAGCGT 
ACCGGGCATG TATGCCTCAA 
ACTTGCATCG CGCGGCCGCC 
TTGAACCCGC ACTGGCTACC 
GGTGCCCGAG GGTAACGATG 
TGTTTTCCCC GCAACCGCAG 
GCAGAGGCGG CGCTGCGAAA 
CGATCTAGGC GCTGCGGCCC 
GCTTGATAGG GTCTCTTACC 
GGCGAGGAGG AGTACCTAAA 
AAACCTGCCT CCGGCATTTC 
AGATGAGTAG ATGGAAGACG 
CCGCGCCCGC CCACCCGTCG 
GTGGGAGGAC GATGACTCGG 
GGAGTGGCAA CCCGTTTGCG 
TAAAAAAAAA AAAGCATGAT 
ACCGAGCGTT GGTTTTCTTG 
GTATGAGGAA GGTCCTCCTC 
CAGTGGCGGC GGCGCTGGGT 
TTTGTGCCTC CGCGGTACCT 
TTACTCTGAG TTGGCACCCC 
ACAACAAGTC AACGGATGTG 
AACTTTCTGA CCACGGTCAT 
AAGCACACAG ACCATCAATC 
TGAAAACCAT CCTGCATACC 



85 

CAGCAGAGCG TGAGCCTTAA CCTGATGCGC 14 650 

GGCGCTGGAC ATGACCGCGC GCAACATGGA 147 00 

ACCGGCCGTT TATCAACCGC CTAATGGACT 14750 

GTGAACCCCG AGTATTTCAC CAATGCCATC 14 8 00 

GCCCCCTGGT TTCTACACCG GGGGATTCGA 14850 

GATTCCTCTG GGACGACATA GACGACAGCG 149 00 

ACCCTGCTAG AGTTGCAACA GCGCGAGCAG 14950 

GGAAAGCTTC CGCAGGCCAA GCAGCTTGTC 15000 

CGCGGTCAGA TGCTAGTAGC CCATTTCCAA 15050 

AGCACTCGCA CCACCCGCCC GCGCCTGCTG 15100 

CAACTCGCTG CTGCAGCCGC AGCGCGAAAA 15150 

CCAACAACGG GATAGAGAGC CTAGTGGACA 152 00 

TACGCGCAGG AGCACAGGGA CGTGCCAGGC 15250 

TCAAAGGCAC GACCGTCAGC GGGGTCTGGT 153 00 

CAGACGACAG CAGCGTCCTG GATTTGGGAG 15 3 50 

CACCTTCGCC CCAGGCTGGG GAGAATGTTT 15400 

GCAAAATAAA AAACTCACCA AGGCCATGGC 15450 

TATTCCCCTT AGTATGCGGC GCGCGGCGAT 15500 

CCTCCTACGA GAGTGTGGTG AGCGCGGCGC 15550 

TCTCCCTTCG ATGCTCCCCT GGACCCGCCG 15600 

GCGGCCTACC GGGGGGAGAA ACAGCATCCG 15650 

TATTCGACAC CACCCGTGTG TAC CTGGTGG 15700 

GCATCCCTGA ACTACCAGAA CGACCACAGC 15750 

TCAAAACAAT GACTACAGCC CGGGGGAGGC 15800 

TTGACGACCG GTCGCACTGG GGCGGCGACC 15850 

AACATGCCAA ATGTGAACGA GTTCATGTTT 15900 



WO 96/39530 



PCT/US96/10245 



86 



ACCAATAAGT 


TTAAGGCGCG 


GGTGATGGTG 


TCGCGCTTGC 


CTACTAAGGA 


15950 


CAATCAGGTG 


GAGCTGAAAT 


ACGAGTGGGT 


GGAGTTCACG 


CTGCCCGAGG 


16000 


GCAACTACTC 


CGAGACCATG 


ACCATAGACC 


TTATGAACAA 


CGCGATCGTG 


16050 


GAGCACTACT 


TGAAAGTGGG 


CAGACAGAAC 


GGGGTTCTGG 


AAAGCGACAT 


16100 


CGGGGTAAAG 


TTTGACACCC 


GCAACTTCAG 


ACTGGGGTTT 


GACCCCGTCA 


16150 


CTGGTCTTGT 


CATGCCTGGG 


GTATATACAA 


ACGAAGCCTT 


CCATCCAGAC 


16200 


ATCATTTTGC 


TGCCAGGATG 


CGGGGTGGAC 


TTCACCCACA 


GCCGCCTGAG 


16250 


CAACTTGTTG 


GGGATCCGCA 


AGCGGCAACC 


CTTCCAGGAG 


GGCTTTAGGA 


16300 


TCACCTACGA 


TGATCTGGAG 


GGTGGTAACA 


TTCCCGCACT 


GTTGGATGTG 


16350 


GACGCCTACC 


AGGCGAGCTT 


GAAAGATGAC 


ACCGAACAGG 


GCGGGGGTGG 


16400 


CGCAGGCGGC 


AGCAACAGCA 


GTGGCAGCGG 


CGCGGAAGAG 


AACTCCAACG 


16450 


CGGCAGCCGC 


GGCAATGCAG 


CCGGTGGAGG 


ACATGAACGA 


TCATGCCATT 


16500 


CGCGGCGACA 


CCTTTGCCAC 


ACGGGCTGAG 


GAGAAGCGCG 


CTGAGGCCGA 


16550 


AGCAGCGGCC 


GAAGCTGCCG 


CCCCCGCTGC 


GCAACCCGAG 


GTCGAGAAGC 


16600 


CTCAGAAGAA 


ACCGGTGATC 


AAACCCCTGA 


CAGAGGACAG 


CAAGAAACGC 


16650 


AGTTACAACC 


TAATAAGCAA 


TGACAGCACC 


TTCACCCAGT 


ACCGCAGCTG 


16700 


GTACCTTGCA 


TACAACTACG 


GCGACCCTCA 


GACCGGAATC 


CGCTCATGGA 


16750 


CCCTGCTTTG 


CACTCCTGAC 


GTAACCTGCG 


GCTCGGAGCA 


GGTCTACTGG 


16800 


TCGTTGCCAG 


ACATGATGCA 


AGACCCCGTG 


ACCTTCCGCT 


CCACGCGCCA 


16850 


GATCAGCAAC 


TTTCCGGTGG 


TGGGCGCCGA 


GCTGTTGCCC 


GTGCACTCCA 


16900 


AGAGCTTCTA 


CAACGACCAG 


GCCGTCTACT 


w w wnAs« x x 


^#W\J\#^*AW X. X X 


± \j u yj 


ACCTCTCTGA 


CCCACGTGTT 


CAATCGCTTT 


CCCGAGAACC 


AGATTTTGGC 


17000 


GCGCCCGCCA 


GCCCCCACCA 


TCACCACCGT 


CAGTGAAAAC 


GTTCCTGCTC 


17050 


TCACAGATCA 


CGGGACG CT A 


CCGCTGCGCA 


ACAGCATCGG 


AGGAGTCCAG 


17100 


CGAGTGACCA 


TTACTGACGC 


CAGACGCCGC 


ACCTGCCCCT 


ACGTTTACAA 


17150 


GGCCCTGGGC 


ATAGTCTCGC 


CGCGCGTCCT 


ATCGAGCCGC 


ACTTTTTGAG 


17200 
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CAAGCATGTC 


CATCCTTATA 


TCGCCCAGCA 


ATAACACAGG 


CTGGGGCCTG 


17250 


CGCTTCCCAA 


GCAAGATGTT 


TGGCGGGGCC 


AAGAAGCGCT 


CCGACCAACA 


17300 


CCCAGTGCGC 


GTGCGCGGGC 


ACTACCGCGC 


GCCCTGGGGC 


GCGCACAAAC 


17350 


GCGGCCGCAC 


TGGGCGCACC 


ACCGTCGATG 


ACGCCATCGA 


CGCGGTGGTG 


17400 


GAGGAGGCGC 


GCAACTACAC 


GCCCACGCCG 


CCACCAGTGT 


CCACAGTGGA 


17450 


CGCGGCCATT 


CAGACCGTGG 


TGCGCGGAGC 


CCGGCGCTAT 


GCTAAAATGA 


17500 


AGAGACGGCG 


GAGGCGCGTA 


GCACGTCGCC 


ACCGCCGCCG 


ACCCGGCACT 


17550 


GCCGCCCAAC 


GCGCGGCGGC 


GGCCCTGCTT 


AACCGCGCAC 


GTCGCACCGG 


17600 


CCGACGGGCG 


GCCATGCGGG 


CCGCTCGAAG 


GCTGGCCGCG 


GGTATTGTCA 


17650 


CTGTGCCCCC 


CAGGTCCAGG 


CGACGAGCGG 


CCGCCGCAGC 


AGCCGCGGCC 


17700 


ATTAGTGCTA 


TGACTCAGGG 


TCGCAGGGGC 


AACGTGTATT 


GGGTGCGCGA 


17750 


CTCGGTTAGC 


GGCCTGCGCG 


TGCCCGTGCG 


CACCCGCCCC 


CCGCGCAACT 


17800 


AGATTGCAAG 


AAAAAACTAC 


TTAGACTCGT 


ACTGTTGTAT 


GTATCCAGCG 


17850 


GCGGCGG CGC 


GCAACGAAGC 


TATGTCCAAG 


CGCAAAATCA 


AAGAAGAGAT 


17900 


GCTCCAGGTC 


ATCGCGCCGG 


AGATCTATGG 


CCCCCCGAAG 


AAGGAAGAGC 


17950 


AGGATTACAA 


GCCCCGAAAG 


CTAAAGCGGG 


TCAAAAAGAA 


AAAGAAAGAT 


18000 


GATGATGATG 


AACTTGACGA 


CGAGGTGGAA 


CTGCTGCACG 


CTACCGCGCC 


18050 


CAGGCGACGG 


GTACAGTGGA 


AAGGTCGACG 


CGTAAAAGGT 


GTTTTGCGAC 


18100 


CCGGCACCAC 


CGTAGTCTTT 


ACGCCCGGTG 


AGCGCTCCAC 


CCGCACCTAC 


18150 


AAGCGCGTGT 


ATGATGAGGT 


GTACGGCGAC 


GAGGACCTGC 


TTGAGCAGGC 


18200 


p a a ccz & acne 


Vm» X, VwVjVjVJVJAV? -L 






AAGGACATGC 


18250 


TGGCGTTGCC 


GCTGGACGAG 


GGCAACCCAA 


CACCTAGCCT 


AAAGCCCGTA 


18300 


ACACTGCAGC 


AGGTGCTGCC 


CGCGCTTGCA 


CCGTCCGAAG 


AAAAGCGCGG 


18350 


CCTAAAGCGC 


GAGTCTGGTG 


ACTTGGCACC 


CACCGTGCAG 


CTGATGGTAC 


18400 


CCAAGCGCCA 


GCGACTGGAA 


GATGTCTTGG 


AAAAAATGAC 


CGTGGAACCT 


18450 


GGGCTGGAGC 


CCGAGGTCCG 


CGTGCGGCCA 


ATCAAGCAGG 


TGGCGCCGGG 


18500 
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ACTGGGCGTG 


CAGACCGTGG 


ACGTTCAGAT 


ACCCACTACC 


AGTAGCACCA 


18550 


GTATTGCCAC 


CGCCACAGAG 


GGCATGGAGA 


CACAAACGTC 


CCCGGTTGCC 


18600 


TCAGCGGTGG 


CGGATGCCGC 


GGTGCAGGCG 


GTCGCTGCGG 


CCGCGTCCAA 


18650 


GACCTCTACG 


GAGGTGCAAA 


CGGACCCGTG 


GATGTTTCGC 


GTTTCAGCCC 


18700 


CCCGGCGCCC 


GCGCGGTTCG 


AGGAAGTACG 


GCGCCGCCAG 


CGCGCTACTG 


18750 


CCCGAATATG 


CCCTACATCC 


TTCCATTGCG 


CCTACCCCCG 


GCTATCGTGG 


18800 


CTACACCTAC 


CGCCCCAGAA 


GACGAGCAAC 


TACCCGACGC 


CGAACCACCA 


18850 


CTGGAACCCG 


CCGCCGCCGT 


CGCCGTCGCC 


AGCCCGTGCT 


GGCCCCGATT 


18900 


TCCGTGCGCA 


GGGTGGCTCG 


CGAAGGAGGC 


AGGACCCTGG 


TGCTGCCAAC 


18950 


AGCGCGCTAC 


CACCCCAGCA 


TCGTTTAAAA 


GCCGGTCTTT 


GTGGTTCTTG 


19000 


CAGATATGGC 


CCTCACCTGC 


CGCCTCCGTT 


TCCCGGTGCC 


GGGATTCCGA 


19050 


GGAAGAATGC 


ACCGTAGGAG 


GGGCATGGCC 


GGCCACGGCC 


TGACGGGCGG 


19100 


CATGCGTCGT 


GCGCACCACC 


GGCGGCGGCG 


CGCGTCGCAC 


CGTCGCATGC 


19150 


GCGGCGGTAT 


CCTGCCCCTC 


CTTATTCCAC 


TGATCGCCGC 


GGCGATTGGC 


19200 


GCCGTGCCCG 


GAATTGCATC 


CGTGGCCTTG 


CAGGCGCAGA 


GACACTGATT 


19250 


AAAAACAAGT 


TGCATGTGGA 


AAAATCAAAA 


TAAAAAGTCT 


GGACTCTCAC 


19300 


GCTCGCTTGG 


TCCTGTAACT 


ATTTTGTAGA 


ATGGAAGACA 


TCAACTTTGC 


19350 


GTCTCTGGCC 


CCGCGACACG GCTCGCGCCC 


GTTCATGGGA 


AACTGGCAAG 


19400 


ATATCGGCAC 


CAGCAATATG 


AGCGGTGGCG 


CCTTCAGCTG 


GGGCTCGCTG 


19450 


TGGAGCGGCA 


TTAAAAATTT 


CGGTTCCACC 


GTTAAGAACT 


ATGGCAGCAA 


19500 


GGCCTGGAAC 


AGCAGCACAG 


GCCAGATGCT 


GAGGGATAAG 


TTGAAAGAGC 


19550 


AAAATTTCCA 


ACAAAAGGTG 


GTAGATGGCC 


TGGCCTCTGG 


CATTAGCGGG 


19600 


GTGGTGGACC 


TGGCCAACCA 


GGCAGTGCAA 


AATAAGATTA 


ACAGTAAGCT 


19650 


TGATCCCCGC 


CCTCCCGTAG 


AGGAGCCTCC 


ACCGGCCGTG 


GAGACAGTGT 


19700 


CTCCAGAGGG 


GCGTGGCGAA 


AAGCGTCCGC 


GCCCCGACAG 


GGAAGAAACT 


19750 


CTGGTGACGC 


AAATAGACGA 


GCCTCCCTCG 


TACGAGGAGG 


CACTAAAGCA 


19800 
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AGGCCTGCCC ACCACCCGTC CCATCGCGCC 
GCCAGCACAC ACCCGTAACG CTGGACCTGC 
CAGAAACCTG TGCTGCCAGG CCCGACCGCC 
CCGCGCGTCC CTGCGCCGCG CCGCCAGCGG 
TAGCCAGTGG CAACTGGCAA AGCACACTGA 
GTGCAATCCC TGAAGCGCCG ACGATGCTTC 
TGTGTGTCAT GTATGCGTCC ATGTCGCCGC 
CGCGCGCCCG CTTTCCAAGA TGGCTACCCC 
CTTACATGCA CATCTCGGGC CAGGACGCCT 
CTGGTGCAGT TTGCCCGCGC CACCGAGACG 
GTTTAGAAAC CCCACGGTGG CGCCTACGCA 
GCCAGCGTTT GACGCTGCGG TTCATCCCTG 
TACTCGTAGA AGGCGCGGTT CACCCTAGCT 
GGACATGGCT TCCACGTACT TTGACATCCG 
CTACTTTTAA GCCCTACTCT GGCACTGCCT 
GGTGCCCCAA ATCCTTGCGA ATGGGATGAA 
AAACCTAGAA GAAGAGGACG ATGACAACGA 
CTGAGCAGCA AAAAACTCAC GTATTTGGGC 
AATATTACAA AGGAGGGTAT TCAAATAGGT 
ATATGCCGAT AAAACATTTC AACCTGAACC 
GGTACGAAAC TGAAATTAAT CATGCAGCTG 
ACCCCAATGA AACCATGTTA CGGTTCATAT 
TGGAGGGCAA GGCATTCTTG TAAAG C AAC A 
AAGTGGAAAT GCAATTTTTC TCAACTACTG 
GATAACTTGA CTCCTAAAGT GGTATTGTAC 
AACCCCAGAC ACTCATATTT CTTACATGCC 



CATGGCTACC 


GGAGTGCTGG 


19850 


CTCCCCCCGC 


CGACACCCAG 


19900 


GTTGTTGTAA 


CCCGTCCTAG 


19950 


TCCGCGATCG 


TTGCGGCCCG 


20000 


ACAGCATCGT 


GGGTCTGGGG 


20050 


TGAATAGCTA 


ACGTGTCGTA 


20100 


CAGAGGAGCT 


GCTGAGCCGC 


20150 


TTCGATGATG 


CCGCAGTGGT 


2 02 00 


CGGAGTACCT 


GAGCCCCGGG 


20250 


TACTTCAGCC 


TGAATAACAA 


20300 


CGACGTGACC 


ACAGACCGGT 


20350 


TGGACCGTGA 


GGATACTGCG 


20400 


GTGGGTGATA 


AC CGTGTGCT 


20450 


CGGCGTGCTG 


GACAGGGGCC 


20500 


ACAACGCCCT 


GGCTCCCAAG 


20550 


GCTGCTACTG 


CTCTTGAAAT 


20600 


AGACGAAGTA 


GACGAGCAAG 


20650 


AGGCGCCTTA 


TTCTGGTATA 


20700 


GTCGAAGGTC 


AAACACCTAA 


20750 


TCAAATAGGA 


GAATCTCAGT 


20800 






20850 


GCAAAACCCA 


CAAATGAAAA 


20900 


AAATGGAAAG 


CTAGAAAGTC 


20950 


AGGCGACCGC 


AGGCAATGGT 


21000 


AGTGAAGATG 


TAGATATAGA 


21050 


CACTATTAAG 


GAAGGTAACT 


21100 
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CACGAGAACT 


AATGGGCCAA 


CAATCTATGC 


CCAACAGGCC 


TAATTACATT 


21150 


GCTTTTAGGG 


ACAATTTTAT 


TGGTCTAATG 


TATTACAACA 


GCACGGGTAA 


21200 


TATGGGTGTT 


CTGGCGGGCC 


AAGCATCGCA 


GTTGAATGCT 


GTTGTAGATT 


21250 


TGCAAGACAG 


AAACACAGAG 


CTTTCATACC 


AGCTTTTGCT 


TGATTCCATT 


21300 


GGTGATAGAA 


CCAGGTACTT 


TTCTATGTGG 


AATCAGGCTG 


TTGACAGCTA 


21350 


TGATCCAGAT 


GTTAGAATTA 


TTGAAAATCA 


TGGAACTGAA 


GATGAACTTC 


21400 


CAAATTACTG 


CTTTCCACTG 


GGAGGTGTGA 


TTAATACAGA 


GACTCTTACC 


21450 


AAGGTAAAAC 


CTAAAACAGG 


TCAGGAAAAT 


GGATGGGAAA 


AAGATGCTAC 


21500 


AGAATTTTCA 


GATAAAAATG 


AAATAAGAGT 


TGGAAATAAT 


TTTGCCATGG 


21550 


AAATCAATCT 


AAATGCCAAC 


CTGTGGAGAA 


ATTTCCTGTA 


CTCCAACATA 


21600 


GCG CTGTATT 


TGCCCGACAA 


GCTAAAGTAC 


AGTCCTTCCA 


ACGTAAAAAT 


21650 


TTCTGATAAC 


CCAAACACCT 


ACGACTACAT 


GAACAAGCGA 


GTGGTGGCTC 


21700 


CCGGGTTAGT 


GGACTGCTAC 


ATTAACCTTG 


GAGCACGCTG 


GTCCCTTGAC 


21750 


TATATGGACA 


ACGTCAACCC 


ATTTAACCAC 


CACCGCAATG 


CTGGCCTGCG 


21800 


CTACCGCTCA 


ATGTTGCTGG 


GCAATGGTCG 


CTATGTGCCC 


TTCCACATCC 


21850 


AGGTGCCTCA 


GAAGTTCTTT 


GCCATTAAAA 


ACCTCCTTCT 


CCTGCCGGGC 


21900 


TCATACACCT 


ACGAGTGGAA 


CTTCAGGAAG 


GATGTTAACA 


TGGTTCTGCA 


21950 


GAGCTCCCTA 


GGAAATGACC 


TAAGGGTTGA 


CGGAGCCAGC 


ATTAAGTTTG 


22000 


ATAGCATTTG 


CCTTTACGCC 


ACCTTCTTCC 


CCATGGCCCA 


CAACACCGCC 


22050 


TCCACGCTTG 


AGGCCATGCT 


TAGAAACGAC 


ACCAACGACC 


AGT CCTTTAA 


22100 


CGACTATCTC 


TCCGCCGCCA 










CCAACGTGCC 


CATATCCATC 


CCCTCCCGCA 


ACTGGGCGGC 


TTTCCGCGGC 


22200 


TGGGCCTTCA 


CGCGCCTTAA 


GACTAAGGAA 


ACCCCATCAC 


TGGGCTCGGG 


22250 


CTACGACCCT 


TATTACACCT 


ACTCTGGCTC 


TATACCCTAC 


CTAGATGGAA 


22300 


CCTTTTACCT 


CAACCACACC 


TTTAAGAAGG 


TGGCCATTAC 


CTTTGACTCT 


22350 


TCTGTCAGCT 


GGCCTGGCAA 


TGACCGCCTG 


CTTACCCCCA 


ACGAGTTTGA 


22400 
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AATTAAGCGC TCAGTTGACG GGGAGGGTTA 
TGACCAAAGA CTGGTTCCTG GTACAAATGC 
TACCAGGGCT TCTATATCCC AGAGAGCTAC 
CTTTAGAAAC TTCCAGCCCA TGAGCCGTCA 
ACAAGGACTA CCAACAGGTG GGCATCCTAC 
TTTGTTGGCT ACCTTGCCCC CACCATGCGC 
TAACTTCCCC TATCCGCTTA TAGGCAAGAC 
AGAAAAAGTT TCTTTGCGAT CGCACCCTTT 
AACTTTATGT CCATGGGCGC ACTCACAGAC 
CGCCAACTCC GCCCACGCGC TAGACATGAC 
ACGAGCCCAC CCTTCTTTAT GTTTTGTTTG 
GTGCACCGGC CGCACCGCGG CGTCATCGAA 
CTTCTCGGCC GGCAACGCCA CAACATAAAG 
AGCTGCCGCC ATGGGCTCCA GTGAGCAGGA 
ATCTTGGTTG TGGGCCATAT TTTTTGGGCA 
GGCTTTGTTT CTCCACACAA GCTCGCCTGC 
T CG CG AG ACT GGGGGCGTAC ACTGGATGGC 
CAAAAACATG CTACCTCTTT GAGCCCTTTG 
AAGCAGGTTT ACCAGTTTGA GTACGAGTCA 
TGCTTCTTCC CCCGACCGCT GTATAACGCT 
TACAGGGGCC CAACTCGGCC GCCTGTGGAC 
CACGCCTTTG CCAACTGGCC CCAAACTCCC 
GAACCTTATT ACCGGGGTAC CCAACTCCAT 
AGCCCACCCT GCGTCGCAAC CAGGAACAGC 
CACTCGCCCT ACTTCCGCAG CCACAGTGCG 
TTTTTGTCAC TTGAAAAACA TGTAAAAATA 



CAACGTTGCC CAGTGTAACA 22450 

TAGCTAACTA CAACATTGGC 22500 

AAGGACCGCA TGTACTCCTT 22550 

GGTGGTGGAT GATACTAAAT 22600 

ACCAACACAA CAACTCTGGA 22 650 

GAAGGACAGG CCTACCCTGC 22700 

CGCAGTTGAC AGCATTACCC 2 2750 

GGCGCATCCC ATTCTCCAGT 228 00 

CTGGGCCAAA ACCTTCTCTA 22850 

TTTTGAGGTG GATCCCATGG 22900 

AAGTCTTTGA CGTGGTCCGT 22950 

ACCGTGTACC TGGGCACGCC 23 000 

AAGCAAGCAA CATCAACAAC 2 3 050 

ACTGAAAGCC ATTGTCAAAG 2 3100 

CCTATGACAA GCGCTTTCCA 23150 

GCCATAGTCA ATACGGCCGG 2 3200 

CTTTGCCTGG AACCCGCACT 2 3250 

GCTTTTCTGA CCAGCGACTC 23 3 00 

CTCCTGCGCC GTAGCGCCAT 2 3350 

GGAAAAGTCC ACCCAAAGCG 2 34 00 

TATTCTGCTG CATGTTTCTC 23450 

ATGGATCACA ACCCCACCAT 2 3 500 

GCTCAACAGT CCCCAGGTAC 2 3 550 

TCTACAGCTT CCTGGAGCGC 2 3 600 

CAGATTAGGA GCGCCACTTC 2 3 650 

ATGTACTAGA GACACTTTCA 23700 
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ATAAAGGCAA ATGCTTTTAT TTGTACACTC 
ACCCTTGCCG TCTGCGCCGT TTAAAAATCA 
GCTATGCGCC ACTGGCAGGG ACACGTTGCG 
ACTTAAACTC AGGCACAACC ATCCGCGGCA 
CACAGGCTGC GCACCATCAC CAACGCGTTT 
CTTGAAGTCG CAGTTGGGGC CTCCGCCCTG 
CAGGGTTGCA GCACTGGAAC ACTATCAGCG 
AGCACGCTCT TGTCGGAGAT CAGATCCGCG 
CAGGGCGAAC GGAGTCAACT TTGGTAGCTG 
GCCCAGGCTT TGAGTTGCAC TCGCACCGTA 
TGCCCGGTCT GGGCGTTAGG ATACAGCGCC 
CTTAAAAGCC ACCTGAGCCT TTGCGCCTTC 
ACTTGCCGGA AAACTGATTG GCCGGACAGG 
CTTGCGTCGG TGTTGGAGAT CTGCACCACA 
CACGATCTTG GCCTTGCTAG ACTGCTCCTT 
CGCTCGTCAC ATCCATTTCA ATCACGTGCT 
CCGTGTAGAC ACTTAAGCTC GCCTTCGATC 
CAACGCGCAG CCCGTGGGCT CGTGATGCTT 
ACTGCAGGTA CGCCTGCAGG AATCGCCCCA 
TTGCTGGTGA AGGTCAGCTG CAACCCGCGG 
CTTGCATACG GCCGCCAGAG CTTCCACTTG 
TCGCCTTTAG ATCGTTATCC ACGTGGTACT 
GCCTCCATGC CCTTCTCCCA CGCAGACACG 
CATCACCGTA ATTTCACTTT CCGCTTCGCT 
GCGTCCGCAT ACCACGCGCC ACTGGGTCGT 
GTGCGCTTAC CTCCTTTGCC ATGCTTGATT 



TCGGGTGATT ATTTACCCCC 23750 

AAGGGGTTCT GCCGCGCATC 2 3800 

ATACTGGTGT TTAGTGCTCC 23850 

GCTCGGTGAA GTTTTCACTC 2 3900 

AGCAGGTCGG GCGCCGATAT 23 950 

CGCGCGCGAG TTGCGATACA 24000 

CCGGGTGGTG CACGCTGGCC 24050 

TCCAGGTCCT CCGCGTTGCT 24100 

CCTTCCCAAA AAGGGCGCGT 24150 

GTGGCATCAA AAGGTGACCG 242 00 

TGCATAAAAG CCTTGATCTG 24250 

AGAGAAGAAC ATGCCGCAAG 24 3 00 

CCGCGT CGTG CACGCAGCAC 24 3 50 

TTTCGGCCCC ACCGGTTCTT 24400 

CAGCGCGCGC TGCCCGTTTT 24450 

CCTTATTTAT CATAATGCTT 24500 

TCAGCGCAGC GGTGCAGCCA 24550 

GTAGGTCACC TCTGCAAACG 24600 

TCATCGTCAC AAAGGTCTTG 24650 

TGCTCCTCGT TCAGCCAGGT 2 47 00 

GTCAGGCAGT AGTTTGAAGT 24750 

TGTCCATCAG CGCGCGCGCA 2 4800 

ATCGGCACAC TCAGCGGGTT 24850 

GGGCTCTTCC TCTTCCTCTT 24900 

CTTCATTCAG CCGCCGCACT 24950 

AGCACCGGTG GGTTGCTGAA 25000 
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ACCCACCATT TGTAGCGCCA CATCTTCTCT TTCTTCCTCG CTGTCCACGA 25050 

TTACCTCTGG TGATGGCGGG CGCTCGGGCT TGGGAGAAGG GCGCTTCTTT 25100 

TTCTTCTTGG GCGCAATGGC CAAATCCGCC GCCGAGGTCG ATGGCCGCGG 25150 

GCTGGGTGTG CGCGGCACCA GCGCGTCTTG TGATGAGTCT TCCTCGTCCT 25200 

CGGACTCGAT ACGCCGCCTC ATCCGCTTTT TTGGGGGCGC CCGGGGAGGC 25250 

GGCGGCGACG GGGACGGGGA CGACACGTCC TCCATGGTTG GGGGACGTCG 253 00 

CGCCGCACCG CGTCCGCGCT CGGGGGTGGT TTCGCGCTGC TCCTCTTCCC 25350 

GACTGGCCAT TTCCTTCTCC TATAGGCAGA AAAAG AT CAT GGAGTCAGTC 25400 

GAGAAGAAGG ACAGCCTAAC CGCCCCCTCT GAGTTCGCCA CCACCGCCTC 25450 

CACCGATGCC GCCAACGCGC CTACCACCTT CCCCGTCGAG GCACCCCCGC 25500 

TTGAGGAGGA GGAAGTGATT ATCGAGCAGG ACCCAGGTTT TGTAAGCGAA 25550 : 

GACGACGAGG ACCGCTCAGT ACCAACAGAG GATAAAAAGC AAGACCAGGA 25600; : 

CAACGCAGAG GCAAACGAGG AACAAGTCGG GCGGGGGGAC GAAAGGCATG 2 5650 

GCGACTACCT AGATGTGGGA GACGACGTGC TGTTGAAGCA TCTGCAGCGC 25700 

CAGTGCGCCA TTATCTGCGA CGCGTTGCAA GAGCGCAGCG ATGTGCCCCT 25750 

CGCCATAGCG GATGTCAGCC TTGCCTACGA ACGCCACCTA TTCTCACCGC 25800 

GCGTACCCCC CAAACGCCAA GAAAACGGCA CATGCGAGCC CAACCCGCGC 2 5850 

CTCAACTTCT ACCCCGTATT TGCCGTGCCA GAGGTGCTTG CCACCTATCA 2 5900 

CATCTTTTTC CAAAACTGCA AGATACCCCT ATCCTGCCGT GCCAACCGCA 2 5950 

GCCGAGCGGA CAAGCAGCTG GCCTTGCGGC AGGGCGCTGT CATACCTGAT 2 6000 

ATCGCCTCGC TCAACGAAGT GCCAAAAATC TTTGAGGGTC TTGGACGCGA 26050 

CGAGAAGCGC GCGGCAAACG CTCTGCAACA GGAAAACAGC GAAAATGAAA 2 6100 

GTCACTCTGG AGTGTTGGTG GAACTCGAGG GTGACAACGC GCGCCTAGCC 2 6150 

GTACTAAAAC GCAGCATCGA GGTCACCCAC TTTGCCTACC CGGCACTTAA 2 6200 

CCTACCCCCC AAGGTCATGA GCACAGTCAT GAGTGAGCTG ATCGTGCGCC 26250 

GTGCGCAGCC CCTGGAGAGG GATGCAAATT TGCAAGAACA AACAGAGGAG 26300 
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GGCCTACCCG 


CAGTTGGCGA 


CGAGCAGCTA 


GCGCGCTGGC 


TTCAAACGCG 


26350 


CGAGCCTGCC 


GACTTGGAGG 


AGCGACGCAA 


ACTAATGATG 


GCCGCAGTGC 


26400 


TCGTTACCGT 


GGAGCTTGAG 


TGCATGCAGC 


GGTTCTTTGC 


TGACCCGGAG 


26450 


ATGCAGCGCA 


AGCTAGAGGA 


AACATTGCAC 


TACACCTTTC 


GACAGGGCTA 


26500 


CGTACGCCAG 


GCCTGCAAGA 


TCTCCAACGT 


GGAGCTCTGC 


AACCTGGTCT 


26550 


CCTACCTTGG 


AATTTTGGAC 


GAAAACCGCC 


TTGGGCAAAA 


CGTGCTTCAT 


26600 


TCCACGCTCA 


AGGGCGAGGC 


GCGCCGCGAC 


TACGTCCGCG 


ACTGCGTTTA 


26650 


CTTATTTCTA 


TGCTACACCT 


GGCAGACGGC 


CATGGGCGTT 


TGGCAGCAGT 


26700 


GCTTGGAGGA 


GTGCAACCTC 


AAGGAGCTGC 


AGAAACTGCT 


AAAGCAAAAC 


26750 


TTGAAGGACC 


TATGGACGGC 


CTTCAACGAG 


CGCTCCGTGG 


CCGCGCACCT 


26800 


GGCGGACATC 


ATTTTCCCCG 


AACGCCTGCT 


TAAAACCCTG 


CAACAGGGTC 


26850 


TGCCAGACTT 


CACCAGTCAA 


AGCATGTTGC 


AGAACTTTAG 


GAACTTTATC 


26900 


CTAGAGCGCT 


CAGGAATCTT 


GCCCGCCACC 


TGCTGTGCAC 


TTCCTAGCGA 


26950 


CTTTGTGCCC 


ATTAAGTACC 


GCGAATGCCC 


TCCGCCGCTT 


TGGGGCCACT 


27000 


GCTACCTTCT 


GCAGCTAGCC 


AACTACCTTG 


CCTACCACTC 


TGACATAATG 


27050 


GAAGACGTGA 


GCGGTGACGG 


TCTACTGGAG 


TGTCACTGTC 


GCTGCAACCT 


27100 


ATGCACCCCG 


CACCGCTCCC 


TGGTTTGCAA 


TTCGCAGCTG 


CTTAACGAAA 


27150 


GTCAAATTAT 


CGGTACCTTT 


GAGCTGCAGG 


GTCCCTCGCC 


TGACGAAAAG 


27200 


TCCGCGGCTC 


CGGGGTTGAA 


ACTCACTCCG 


GGGCTGTGGA 


CGTCGGCTTA 


27250 


CCTTCGCAAA 


TTTGTACCTG 


AGGACTACCA 


CGCCCACGAG 


ATTAGGTTCT 


27300 


ACGAAGACCA 


ATCCCGCCCG 


CCAAATGCGG 


AGCTTACCGC 


CTGCGTCATT 


27350 


ACCCAGGGCC 


ACATTCTTGG 


CCAATTGCAA 


GCCATCAACA 


AAGCCCGCCA 


27400 


AGAGTTTCTG 


CTACGAAAGG 


GACGGGGGGT 


TTACTTGGAC 


CCCCAGTCCG 


27450 


GCGAGGAGCT 


CAACCCAATC 


CCCCCGCCGC 


CGCAGCCCTA 


TCAGCAGCAG 


27500 


CCGCGGGCCC 


TTGCTTCCCA 


GGATGGCACC 


CAAAAAGAAG 


CTGCAGCTGC 


27550 


CGCCGCCACC 


CACGGACGAG 


GAGGAATACT 


GGGACAGTCA 


GGCAGAGGAG 


27600 
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GTTTTGGACG AGGAGGAGGA GGACATGATG GAAGACTGGG AGAGCCTAGA 27650 
CGAGGAAGCT TCCGAGGTCG AAGAGGTGTC AGACGAAACA CCGTCACCCT 27700 
CGGTCGCATT CCCCTCGCCG GCGCCCCAGA AATCGGCAAC CGGTTCCAGC 27750 
ATGGCTACAA CCTCCGCTCC TCAGGCGCCG CCGGCACTGC CCGTTCGCCG 27800 
ACCCAACCGT AGATGGGACA CCACTGGAAC CAGGGCCGGT AAGTCCAAGC 27850 
AGCCGCCGCC GTTAGCCCAA GAGCAACAAC AGCGCCAAGG CTACCGCTCA 27900 
TGGCGCGGGC ACAAGAACGC CATAGTTGCT TGCTTGCAAG ACTGTGGGGG 27950 
CAACATCICC TTCGCCCGCC GCTTTCTTCT CTACCATCAC GGCGTGGCCT 28000 
TCCCCCGTAA CATCCTGCAT TACTACCGTC ATCTCTACAG CCCATACTGC 28050 
ACCGGCGGCA GCGGCAGCGG CAGCAACAGC AGCGGCCACA CAGAAGCAAA 28100 
GGCGACCGGA TAGCAAGACT CTGACAAAGC CCAAGAAATC CACAGCGGCG 28150 
GCAGCAGCAG GAGGAGGAGC GCTGCGTCTG GCGCCCAACG AACCCGTATC 28200 
GACCCGCGAG CTTAGAAACA GGATTTTTCC CACTCTGTAT GCTATATTTC 28250 
AACAGAGCAG GGGCCAAGAA CAAGAGCTGA AAATAAAAAA CAGGTCTCTG 283 00 
CGATCCCTCA CCCGCAGCTG CCTGTATCAC AAAAGCGAAG ATCAGCTTCG 28350 
GCGCACGCTG GAAGACGCGG AGGCTCTCTT CAGTAAATAC TGCGCGCTGA 28400 
CTCTTAAGGA CTAGTTTCGC GCCCTTTCTC AAATTTAAGC GCGAAAACTA 28450 
CGTCATCTCC AGCGGCCACA CCCGGCGCCA GCACCTGTCG TCAGCGCCAT 28500 
TATGAGCAAG GAAATTCCCA CGCCCTACAT GTGGAGTTAC CAGCCACAAA 28550 
TGGGACTTGC GGCTGGAGCT GCCCAAGACT ACTCAACCCG AATAAACTAC 28600 
ATGAGCGCGG GACCCCACAT GATATCCCGG GTCAACGGAA TCCGCGCCCA 28650 
CCGAAACCGA ATTCTCTTGG AACAGGCGGC TATTACCACC ACACCTCGTA 28700 
ATAACCTTAA TCCCCGTAGT TGGCCCGCTG CCCTGGTGTA CCAGGAAAGT 28750 
CCCGCTCCCA CCACTGTGGT ACTTCCCAGA GACGCCCAGG CCGAAGTTCA 28800 
GATGACTAAC TCAGGGGCGC AGCTTGCGGG CGGCTTTCGT CACAGGGTGC 28850 
GGTCGCCCGG GCAGGGTATA ACTCACCTGA CAATCAGAGG GCGAGGTATT 28900 
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CAGCTCAACG ACGAGTCGGT GAGCTCCTCG CTTGGTCTCC GTCCGGACGG 28950 

GACATTTCAG ATCGGCGGCG CCGGCCGTCC TTCATTCACG CCTCGTCAGG 29000 

CAATCCTAAC TCTGCAGACC TCGTCCTCTG AGCCGCGCTC TGGAGGCATT 29050 

GGAACTCTGC AATTTATTGA GGAGTTTGTG CCATCGGTCT ACTTTAACCC 29100 

CTTCTCGGGA CCTCCCGGCC ACTAT CCGGA TCAATTTATT CCTAACTTTG 29150 

ACGCGGTAAA GGACTCGGCG GACGGCTACG ACTGAATGTT AAGTGGAGAG 29200 

GCAGAGCAAC TGCGCCTGAA ACACCTGGTC CACTGTCGCC GCCACAAGTG 29250 

CTTTGCCCGC GACTCCGGTG AGTTTTGCTA CTTTGAATTG CCCGAGGATC 29300 

ATATCGAGGG CCCGGCGCAC GGCGTCCGGC TTACCGCCCA GGGAGAGCTT 29350 

GCCCGTAGCC TGATTCGGGA GTTTACCCAG CGCCCCCTGC TAGTTGAGCG 29400 

GGACAGGGGA CCCTGTGTTC TCACTGTGAT TTGCAACTGT CCTAACCTTG 29450 

GATTACATCA AGATCTTTGT TGCCATCTCT GTGCTGAGTA TAATAAATAC 29500 

AGAAATTAAA ATATACTGGG GCTCCTATCG CCATCCTGTA AACGCCACCG 29550 

TCTTCACCCG CCCAAGCAAA CCAAGGCGAA CCTTACCTGG TACTTTTAAC 29600 

ATCTCTCCCT CTGTGATTTA CAACAGTTTC AACCCAGACG GAGTGAGTCT 29650 

ACGAGAGAAC CTCTCCGAGC TCAGCTACTC CATCAGAAAA AACACCACCC 29700 

TCCTTACCTG CCGGGAACGT ACGAGTGCGT CACCGGCCGC TGCACCACAC 29750 

CTACCGCCTG ACCGTAAACC AGACTTTTTC CGGACAGACC TCAATAACTC 29800 

TGTTTACCAG AACAGGAGGT GAGCTTAGAA AACCCTTAGG GTATTAGGCC 29850 

AAAGGCGCAG CTACTGTGGG GTTTATGAAC AATTCAAGCA ACTCTACGGG 29900 

CTATTCTAAT TCAGGTTTCT CTAGAATCGG GGTTGGGGTT ATTCTCTGTC 29950 

TTGTGATTCT CTTTATTCTT ATACTAACGC TTCTCTGCCT AAGGCTCGCC 30000 

GCCTGCTGTG TGCACATTTG CATTTATTGT CAGCTTTTTA AACGCTGGGG 30050 

TCGCCACCCA AGATGATTAG GTACATAATC CTAGGTTTAC TCACCCTTGC 30100 

GTCAGCCCAC GGTACCACCC AAAAGGTGGA TTTTAAGGAG CCAGCCTGTA 30150 

ATGTTACATT CGCAGCTGAA GCTAATGAGT GCACCACTCT TATAAAATGC 30200 
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ACCACAGAAC ATGAAAAGCT GCTTATTCGC 
GTATGCTGTT TATGCTATTT GGCAGCCAGG 
TTACAGTTTT CCAGGGTAAA AGTCATAAAA 
TTTTATGAAA TGTG CGAC AT TACCATGTAC 
GTGGCCCCCA CAAAATTGTG TGGAAAACAC 
CTATGCTAAT TACAGTGCTC GCTTTGGTCT 
TACAAAAGCA GACGCAGCTT TATTGAGGAA 
TAAGTTACAA AGCTAATGTC ACCACTAACT 
AACAAATTCA AAAAGTT AG C ATTATAATTA 
GGTCATTTCC TGCTCAATAC CATTCCCCTG 
ATATGCTCCA GCGCTACAAC CTTGAAGTCA 
CTGACTTTGG CCAGCACCTG TCCCGCGGAT 
CGACCCACCC TAACAGAGAT GACCAACACA 
CGGACTTACA TCTACCACAA ATACACCCCA 
ACTGGGATAA CTTGGGCATG TGGTGGTTCT 
TGCCTTATTA TTATGTGGCT CATCTGCTGC 
ACCACCCATC TATAGTCCCA TCATTGTGCT 
TCCATAGATT GGACGGACTG AAACACATGT 
TTAAATGAGA CATGATTCCT CGAGTTTTTA 
CTTTTTTGTG CGTGCTCCAC ATTGGCTGCG 
CTGCATTCCA GCCTTCACAG TCTATTTGCT 
CGCTCATCTG CAGCCTCATC ACTGTGGTCA 
GACTGGGTCT GTGTGCGCTT TGCATATCTC 
GGACAGGACT ATAGCTGAGC TTCTTAGAAT 
TGTGACTTTT CTGCTGATTA TTTGCACCCT 
CCTCCAAGCC TCAAAGACAT ATATCATGCA 



CACAAAAACA 


AAATTGGCAA 


30250 


TGACACTACA 


GAGTATAATG 


30300 


CTTTTATGTA 


TACTTTTCCA 


30350 


ATGAGCAAAC 


AGTATAAGTT 


30400 


TGGCACTTTC 


TGCTGCACTG 


30450 


GTACCCTACT 


CTATATTAAA 


30500 


AAGAAAATGC 


CTTAATTTAC 


30550 


GCTTTACTCG 


CTGCTTGCAA 


30600 


GAATAGGATT 


TAAACCCCCC 


30650 


AACAATTGAC 


TCTATGTGGG 


30700 


GGCTTCCTGG 


ATGTCAGCAT 


30750; 


TTGTTCCAGT 


CCAACTACAG 


30800 


ACCAACGCGG 


CCGCCGCTAC 


30850. 


AGTTTCTGCC 


TTTGTCAATA 


30900^ 


CCATAGCGCT 


TATGTTTGTA 


30950; 


CTAAAGCGCA 


AACGCGCCCG 


31000 


ACACCCAAAC 


AATGATGGAA 


31050- 


TCTTTTCTCT 


TACAGTATGA 


31100 


TATTACTGAC 


CCTTGTTGCG 


31150 


GTTTCTCACA 


TCGAAGTAGA 


31200 




GT C ACC CT C A 


31250 


TCGCCTTTAT 


CCAGTGCATT 


31300 


AGACACCATC 


CCCAGTACAG 


31350 


TCTTTAATTA 


TGAAATTTAC 


31400 


AT CTG CGTTT 


TGTTCCCCGA 


31450 


GATTCACTCG 


TATATGGAAT 


31500 
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ATTCCAAGTT 


GCTACAATGA 


AAAAAGCGAT 


CTTTCCGAAG 


CCTGGTTATA 


31550 


TGCAATCATC 


TCTGTTATGG 


TGTTCTGCAG 


TACCATCTTA 


GCCCTAGCTA 


31600 


TATATCCCTA 


CCTTGACATT 


GGCTGGAAAC 


GAATAGATGC 


CATGAACCAC 


31650 


CCAACTTTCC 


CCGCGCCCGC 


TATGCTTCCA 


CTGCAACAAG 


TTGTTGCCGG 


31700 


CGGCTTTGTC 


CCAGCCAATC 


AGCCTCGCCC 


CACTTCTCCC 


ACCCCCACTG 


31750 


AAATCAGCTA 


CTTTAATCTA 


ACAGGAGGAG 


ATGACTGACA 


CCCTAGATCT 


31800 


AGAAATGGAC 


GGAATTATTA 


CAGAGCAGCG 


CCTGCTAGAA 


AGACGCAGGG 


31850 


CAGCGGCCGA 


GCAACAGCGC 


ATGAATCAAG 


AGCTCCAAGA 


CATGGTTAAC 


31900 


TTGCACCAGT 


GCAAAAGGGG 


TATCTTTTGT 


CTGGTAAAGC 


AGGCCAAAGT 


31950 


CACCTACGAC 


AGTAATACCA 


CCGGACACCG 


CCTTAGCTAC 


AAGTTGCCAA 


32000 


CCAAGCGTCA 


GAAATTGGTG 


GTCATGGTGG 


GAGAAAAGCC 


CATTACCATA 


32050 


ACTCAGCACT 


CGGTAGAAAC 


CGAAGGCTGC 


ATTCACTCAC 


CTTGTCAAGG 


32100 


ACCTGAGGAT 


CTCTGCACCC 


TTATTAAGAC 


CCTGTGCGGT 


CTCAAAGATC 


32150 


TTATTCCCTT 


TAACTAATAA 


AAAAAAATAA 


TAAAGCATCA 


CTTACTTAAA 


32200 


ATGAGTTAGC 


AAATTTCTGT 


CCAGTTTATT 


CAGCAGCACC 


TCCTTGCCCT 


32250 


CCTCCCAGCT 


CTGGTATTGC 


AGCTTCCTCC 


TGGCTGCAAA 


CTTTCTCCAC 


32300 


AATCTAAATG 


GAATGTCAGT 


TTCCTCCTGT 


TCCTGTCCAT 


CCGCACCCAC 


32350 


TATCTTCATG 


TTGTTGCAGA 


TGAAGCGCGC 


AAGACCGTCT 


GAAGATACCT 


32400 


TCAACCCCGT 


GTATCCATAT 


GACACGGAAA 


CCGGTCCTCC 


AACTGTGCCT 


32450 


TTTCTTACTC 


CTCCCTTTGT 


ATCCCCCAAT 


GGGTTTCAAG 


AGAGTCCCCC 


32500 


TGGGGTACTC 


TCTTTGCGCC 


TATCCGAACC 


TCTAGTTACC 


TCCAATGGCA 


3 2550 


TGCTTGCGCT 


CAAAATGGGC 


AACGGCCTCT 


CTCTGGACGA 


GGCCGGCAAC 


32600 


CTTACCTCCC 


AAAATGTAAC 


CACTGTGAGC 


CCACCTCTCA 


AAAAAACCAA 


32650 


GTCAAACATA 


AACCTGGAAA 


TATCTGCACC 


CCTCACAGTT 


ACCTCAGAAG 


32700 


CCCTAACTGT 


GGCTGCCGCC 


GCACCTCTAA 


TGGTCGCGGG 


CAACACACTC 


32750 


ACCATGCAAT 


CACAGGCCCC 


GCTAACCGTG 


CACGACTCCA 


AACTTAGCAT 


32800 
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TGCCACCCAA GGACCCCTCA CAGTGTCAGA 
CATCAGGCCC CCTCACCACC ACCGATAGCA 
TCACCCCCTC TAACTACTGC CACTGGTAGC 
GCCCATTTAT ACACAAAATG GAAAACTAGG 
TGCATGTAAC AGACGACCTA AACACTTTGA 
GTGACTATTA ATAATACTTC CTTGCAAACT 
TTTTGATTCA CAAGGCAATA TGCAACTTAA 
TTGATTCTCA AAACAGACGC CTTATACTTG 
GCTCAAAACC AACTAAATCT AAGACTAGGA 
CTCAGCCCAC AACTTGGATA TTAACTACAA 
CAGCTTCAAA CAATTCCAAA AAGCTTGAGG 
GGGTTGATGT TTGACGCTAC AGCCATAGCC 
TGAATTTGGT TCACCTAATG CACCAAACAC 
TTGGCCATGG CCTAGAATTT GATTCAAACA 
GGAACTGGCC TTAGTTTTGA CAGCACAGGT 
AAATAATGAT AAGCTAACTT TGTGGACCAC 
GTAGACTAAA TGCAGAGAAA GATGCTAAAC 
TGTGGCAGTC AAATACTTGC TACAGTTTCA 
TTTGGCTCCA ATATCTGGAA CAGTTCAAAG 
TTGACGAAAA TGGAGTGCTA CTAAACAATT 
TGGAACTTTA GAAATGGAGA TCTTACTGAA 
TGTTGGATTT ATGCCTAACC TATCAGCTTA 
CTGCCAAAAG TAACATTGTC AGTCAAGTTT 
AAACCTGTAA CACTAACCAT TACACTAAAC 
CACAACTCCA AGTGCATACT CTATGTCATT 
ACAACTACAT TAATGAAATA TTTGCCACAT 



AGGAAAGCTA GCCCTGCAAA 32850 

GTACC CTTAC TATCACTGCC 32900 

TTGGGCATTG ACTTGAAAGA 32950 

ACTAAAGTAC GGGGCTCCTT 33000 

CCGTAGCAAC TGGTCCAGGT 33050 

AAAGTTACTG GAGCCTTGGG 3 3100 

TGTAGCAGGA GGACTAAGGA 3 3150 

ATGTTAGTTA TCCGTTTGAT 33200 

CAGGGCCCTC TTTTTATAAA 33250 

CAAAGGCCTT TACTTGTTTA 33300 

TTAACCTAAG CACTGCCAAG 3 3350 

ATTAATGCAG GAGATGGGCT 3 3400, 

AAATCCCCTC AAAACAAAAA 33450, 

AGGCTATGGT TCCTAAACTA 3 3 500: 

GCCATTACAG TAGGAAACAA 3 3 550 

ACCAGCTCCA TCTCCTAACT 33 600; 

TCACTTTGGT CTTAACAAAA 33 650 

GTTTTGGCTG TTAAAGGCAG 33700 

TGCTCATCTT ATTATAAGAT 33750 

CCTTCCTGGA CCCAGAATAT 3 3800 

GGCACAGCCT ATACAAACGC 3 3 850 

TCCAAAATCT CACGGTAAAA 3 39 00 

ACTTAAACGG AGACAAAACT 33950 

GGTACACAGG AAACAGGAGA 34 000 

TTCATGGGAC TGGTCTGGCC 34050 

CCTCTTACAC TTTTTCATAC 34100 
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ATTGCCCAAG AATAAAGAAT CGTTTGTGTT 
TTCAATTGCA GAAAATTTCA AGTCATTTTT 
CCACCACATA GCTTATACAG ATCACCGTAC 
CCTAGTATTC AACCTGCCAC CTCCCTCCCA 
TTTCTCCCCG GCTGGCCTTA AAAAG CATC A 
TTCTTAGGTG TTATATTCCA CACGGTTTCC 
AGTGATATTA ATAAACTGGC GGCGATATAA 
AAATCAGGCA AAGCCTCGCG CAAAAAAGAA 
ATGCAGATAA AGGCAGGTAA GCTCCGGAAC 
TTTTTCTCTC AAACATGTCT GCGGGTTTCT 
AACAAAAAAA CATTTAAACA TTAGAAGCCT 
ACCCTTATAA GCATAAGACG GACTACGGCC 
AAACTGGTCA CCGTGATTAA AAAGCACCAC 
CCGGAGTCAT AATGTAAGAC TCGGTAAACA 
CAGTGCTAAA AAGCGACCGA AATAGCCCGG 
GTAGAGACAA CATTACAGCC CCCATAGGAG 
GAGAAAAACA CATAAACACC TGAAAAACCC 
ACCCTCCCGC TCCAGAACAA CATACAGCGC 
GTCAGCCTTA CCAGTAAAAA AGAAAACCTA 
ACGGCACCAG CTCAATCAGT CACAGTGTAA 
GAGTATATAT AGGACTAAAA AATGACGTAA 
CACCCAGAAA ACCGCACGCG AACCTACGCC 
CCCACAACTT CCTCAAATCG TCACTTCCGT 
CCCATTTTAA GAAAACTACA ATTCCCAACA 
AAAACCTACG TCACCCGCCC CGTTCCCACG 



ATGTTTCAAC GTGTTTATTT 34150 

CATTCAGTAG TATAGCCCCA 34200 

CTTAATCAAA CTCACAGAAC 34250 

ACACACAGAG TACACAGTCC 34300 

TATCATGGGT AACAGACATA 34350 

TGTCGAGCCA AACGCTCATC 34400 

AATGCAAGGT GCTGCTCAAA 34450 

AGCACATCGT AGTCATGCTC 34500 

CACCACAGAA AAAGACACCA 34550 

GCATAAACAC AAAATAAAAT 34600 

GTCTTACAAC AGGAAAAACA 34650 

ATGCCGGCGT GACCGTAAAA 34700 

CGACAGCTCC TCGGTCATGT 34750 

CATCAGGTTG ATTCATCGGT 34800 

GGGAATACAT ACCCGCAGGC 34850 

GTATAACAAA ATTAATAGGA 34900 

TCCTGCCTAG GCAAAATAGC 34950 

TTCACAGCGG CAGCCTAACA 35000 

TTAAAAAAAC ACCACTCGAC 3 5050 

AAAAGGGCCA AGTGCAGAGC 35100 

CGGTTAAAGT CCACAAAAAA 35150 

CAGAAACGAA AGCCAAAAAA 35200 

TTTCCCACGT TACGTAACTT 35250 

CATACAAGTT ACTCCGCCCT 3 5300 

CCCCGCGCCA CGTCACAAAC 3 5350 
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TCCACCCCCT CATTATCATA TTGGCTTCAA TCCAAAATAA GGTATATTAT 35400 

TGATGATG 35408 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCCCAATACG CAAACCGCCT CTCCCCGCGC GTTGGCCGAT TCATTAATGC 50 

AGCTGCGCGC TCGCTCGCTC ACTGAGGCCG CCCGGGCAAA GCCCGGGCGT 100 

CGGGCGACCT TTGGTCGCCC GGCCTCAGTG AGCGAGCGAG CGCGCAGAGA 150 

GGGAGTGGCC AACTCCATCA CTAGGGGTTC CTTGTAGTTA ATGATTAACC 2 00 

CGCCATGCTA CTTATCTACG TAGCCATTCT CTAGCCCCTG CAGGTCGTTA 250 

CATAACTTAC GGTAAATGGC CCGCCTGGCT GACCGCCCAA CGACCCCCGC 3 00 

CCATTGACGT CAATAATGAC GTATGTTCCC ATAGTAACGC CAATAGGGAC 3 50 

TTTCCATTGA CGTCAATGGG TGGAGTATTT ACGGTAAACT GCCCACTTGG 400 

CAGTACATCA AGTGTATCAT ATGCCAAGTA CGCCCCCTAT TGACGTCAAT 4 50 

GACGGTAAAT GGCCCGCCTG GCATTATGCC CAGTACATGA CCTTATGGGA 500 

CTTTCCTACT TGGCAGTACA TCTACGTATT AGTCATCGCT ATTACCATGG 550 

TGATGCGGTT TTGGCAGTAC ATCAATGGGC GTGGATAGCG GTTTGACTCA 600 

CGGGGATTTC CAAGTCTCCA CCCCATTGAC GTCAATGGGA GTTTGTTTTG 650 

GCACCAAAAT CAACGGGACT TTCCAAAATG TCGTAACAAC TCCGCCCCAT 700 

TGACGCAAAT GGGCGGTAGG CGTGTACGGT GGGAGGTCTA TATAAGCAGA 7 50 

GCTCGTTTAG TGAACCGTCA GATCGCCTGG AGACGCCATC CACGCTGTTT 800 

TGACCTCCAT AGAAGACACC GGGACCGATC CAGCCTCCGG ACTCTAGAGG 850 

ATCCGGTACT CGAGGAACTG AAAAACCAGA AAGTTAACTG GTAAGTTTAG 900 
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TCTTTTTGTC 


TTTTATTTCA 


GGTCCCGGAT 


CCGGTGGTGG 


TGCAAATCAA 


950 


AGAACTGCTC 


CTCAGTGGAT 


GTTGCCTTTA 


CTTCTAGGCC 


TGTACGGAAG 


1000 


TGTTACTTCT 


GCTCTAAAAG 


CTGCGGAATT 


GTACCCGCGG 


CCGCAATTCC 


1050 


CGGGGATCGA 


AAGAGCCTGC 


TAAAGCAAAA 


AAGAAGTCAC 


CATGTCGTTT 


1100 


ACTTTGACCA 


ACAAGAACGT 


GATTTTCGTT 


GCCGGTCTGG 


GAGGCATTGG 


1150 


TCTGGACACC 


AGCAAGGAGC 


TGCTCAAGCG 


CGATCCCGTC 


GTTTTACAAC 


1200 


GTCGTGACTG 


GGAAAACCCT 


GGCGTTACCC 


AACTTAATCG 


CCTTGCAGCA 


1250 


CATCCCCCTT 


TCGCCAGCTG 


GCGTAATAGC 


GAAGAGGCCC 


GCACCGATCG 


1300 


CCCTTCCCAA 


CAGTTGCGCA 


GCCTGAATGG 


CGAATGGCGC 


TTTGCCTGGT 


1350 


TTCCGGCACC 


AGAAGCGGTG 


CCGGAAAGCT 


GGCTGGAGTG 


CGATCTTCCT 


1400 


GAGGCCGATA 


CTGTCGTCGT 


CCCCTCAAAC 


TGGCAGATGC 


ACGGTTACGA 


1450 


TGCGCCCATG 


TACACCAACG 


TAACCTATCC 


CATTACGGTC 


AATCCGCCGT 


1500 


TTGTTCCCAC 


GGAGAATCCG 


ACGGGTTGTT 


ACTCGCTCAC 


ATTTAATGTT 


1550 


GATGAAAGCT 


GGCTACAGGA 


AGGCCAGACG 


CGAATTATTT 


TTGATGGCGT 


1600 


TAACTCGGCG 


TTTCATCTGT 


GGTGCAACGG 


GCGCTGGGTC 


GGTTACGGCC 


1650 


AGGACAGTCG 


TTTGCCGTCT 


GAATTTGACC 


TGAGCGCATT 


TTTACGCGCC 


1700 


GGAGAAAACC 


GCCTCGCGGT 


GATGGTGCTG 


CGTTGGAGTG 


ACGGCAGTTA 


1750 


TCTGGAAGAT 


CAGGATATGT 


GGCGGATGAG 


CGGCATTTTC 


CGTGACGTCT 


1800 


CGTTGCTGCA 


TAAACCGACT 


ACACAAATCA 


GCGATTTCCA 


TGTTGCCACT 


1850 


CGCTTTAATG 


ATGATTTCAG 


CCGCGCTGTA 


CTGGAGGCTG 


AAGTTCAGAT 


1900 


GTGCGGCGAG 


TTGCGTGACT 


ACCTACGGGT 


AACAGTTTCT 


TTATGGCAGG 


1950 


GTGAAACGCA 


GGTCGCCAGC 


GGCACCGCGC 


CTTTCGGCGG 


TGAAATTATC 


2000 


GATGAGCGTG 


GTGGTTATGC 


CGATCGCGTC 


ACACTACGTC 


TGAACGTCGA 


2050 


AAACCCGAAA 


CTGTGGAGCG 


CCGAAATCCC 


GAATCTCTAT 


CGTGCGGTGG 


2100 


TTGAACTGCA 


CACCGCCGAC 


GGCACGCTGA 


TTGAAGCAGA 


AGCCTGCGAT 


2150 


GTCGGTTTCC 


GCGAGGTGCG 


GATTGAAAAT 


GGTCTGCTGC 


TGCTGAACGG 


2200 
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CAAGCCGTTG 


CTGATTCGAG 


GCGTTAACCG 


TCACGAGCAT 


CATCCTCTGC 


2250 


ATGGTCAGGT 


CATGGATGAG 


CAGACGATGG 


TGCAGGATAT 


CCTGCTGATG 


2300 


AAGCAGAACA 


ACTTTAACGC 


CGTGCGCTGT 


TCGCATTATC 


CGAACCATCC 


2350 


GCTGTGGTAC 


ACGCTGTGCG 


ACCGCTACGG 


CCTGTATGTG 


GTGGATGAAG 


2400 


CCAATATTGA 


AACCCACGGC 


ATGGTGCCAA 


TGAATCGTCT 


GACCGATGAT 


2450 


CCGCGCTGGC 


TACCGGCGAT 


GAGCGAACGC 


GTAACGCGAA 


TGGTGCAGCG 


2500 


CGATCGTAAT 


CACCCGAGTG 


TGATCATCTG 


GTCGCTGGGG 


AATGAATCAG 


2550 


GCCACGGCGC 


TAATCACGAC 


GCGCTGTATC 


GCTGGATCAA 


ATCTGTCGAT 


2600 


CCTTCCCGCC 


CGGTGCAGTA 


TGAAGGCGGC 


GGAGCCGACA 


CCACGGCCAC 


2650 


CGATATTATT 


TGCCCGATGT 


ACGCGCGCGT 


GGATGAAGAC 


CAGCCCTTCC 


2700 


CGGCTGTGCC 


GAAATGGTCC 


ATCAAAAAAT 


GGCTTTCGCT 


ACCTGGAGAG 


2750 


ACGCGCCCGC 


TGATCCTTTG 


CGAATACGCC 


CACGCGATGG 


GTAACAGTCT 


2800 


TGGCGGTTTC 


GCTAAATACT 


GGCAGGCGTT 


TCGTCAGTAT 


CCCCGTTTAC 


2850 


AGGGCGGCTT 


CGTCTGGGAC 


TGGGTGGATC 


AGTCGCTGAT 


TAAATATGAT 


2900 


GAAAACGGCA 


ACCCGTGGTC 


GGCTTACGGC 


GGTGATTTTG 


GCGATACGCC 


2950 


GAACGATCGC 


CAGTTCTGTA 


TGAACGGTCT 


GGTCTTTGCC 


GACCGCACGC 


3000 


CGCATCCAGC 


GCTGACGGAA 


GCAAAACACC 


AGCAGCAGTT 


TTTCCAGTTC 


3050 


CGTTTATCCG 


GGCAAACCAT 


CGAAGTGACC 


AGCGAATACC 


TGTTCCGTCA 


3100 


TAGCGATAAC 


GAGCTCCTGC 


ACTGGATGGT 


GGCGCTGGAT 


GGTAAGCCGC 


3150 


TGGCAAGCGG 


TGAAGTGCCT 


CTGGATGTCG 


CTCCACAAGG 


TAAACAGTTG 


3200 


nil uaALi i. 


LibiiAtlALL 


/•*»/•"» a r* o r^f^r* 7\ 

bLAGCCGGAG 


AGCGCCGGGC 


AACTCTGGCT 


~y ~t cz r\ 

3 250 


CACAGTACGC 


GTAGTGCAAC 


CGAACGCGAC 


CGCATGGTCA 


GAAGCCGGGC 


3300 


ACATCAGCGC 


CTGGCAGCAG 


TGGCGTCTGG 


CGGAAAACCT 


CAGTGTGACG 


3350 


CTCCCCGCCG 


CGTCCCACGC 


CATCCCGCAT 


CTGACCACCA 


GCGAAATGGA 


3400 


TTTTTGCATC 


GAGCTGGGTA 


ATAAGCGTTG 


GCAATTTAAC 


CGCCAGTCAG 


3450 


GCTTTCTTTC 


ACAGATGTGG 


ATTGGCGATA 


AAAAACAACT 


GCTGACGCCG 


3500 
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CTGCGCGATC AGTTCACCCG TGCACCGCTG 
TGAAGCGACC CGCATTGACC CTAACGCCTG 
CGGGCCATTA CCAGGCCGAA GCAGCGTTGT 
CTTGCTGATG CGGTGCTGAT TACGACCGCT 
GAAAACCTTA TTTATCAGCC GGAAAACCTA 
AAATGGCGAT TACCGTTGAT GTTGAAGTGG 
GCGCGGATTG GCCTGAACTG CCAGCTGGCG 
CTGGCTCGGA TTAGGGCCGC AAGAAAACTA 
CCTGTTTTGA CCGCTGGGAT CTGCCATTGT 
GTCTTCCCGA GCGAAAACGG TCTGCGCTGC 
TGGCCCACAC CAGTGGCGCG GCGACTTCCA 
GTCAACAGCA ACTGATGGAA ACCAGCCATC 
GAAGGCACAT GGCTGAATAT CGACGGTTTC 
CGACTCCTGG AGCCCGTCAG TATCGGCGGA 
GCTACCATTA CCAGTTGGTC TGGTGTCAAA 
CCATGTCTGC CCGTATTTCG CGTAAGGAAA 
AAACACAAAC TTTTGGATGT TCGGTTTATT 
TCATGGGAGC CTACTTCCCG TTTTTCCCGA 
CATATCAGCA AAAGTGATAC GGGTATTATT 
CTCGCTATTA TTCCAACCGC TGTTTGGTCT 
TCGACTCTAG GCGGCCGCGG GGATCCAGAC 
GTTTGGACAA ACCACAACTA GAATGCAGTG 
AAATTTGTGA TGCTATTGCT TTATTTGTAA 
CAAGTTAACA ACAACAATTG CATTCATTTT 
GGTGTGGGAG GTTTTTTCGG ATCCTCTAGA 
ATGGCTACGT AGATAAGTAG CATGGCGGGT 



GATAACGACA TTGGCGTAAG 3550 

GGTCGAACGC TGGAAGGCGG 3600 

TGCAGTGCAC GGCAGATACA 3 650 

CACGCGTGGC AGCATCAGGG 3700 

CCGGATTGAT GGTAGTGGTC 3750 

CGAGCGATAC ACCGCATCCG 3800 

CAGGTAGCAG AGCGGGTAAA 3850 

TCCCGACCGC CTTACTGCCG 3900 

CAGACATGTA TACCCCGTAC 3950 

GGGACGCGCG AATTGAATTA 4000 

GTTCAACATC AGCCGCTACA 4050 

GCCATCTGCT GCACGCGGAA 4100 

CATATGGGGA TTGGTGGCGA 4150 

ATTACAGCTG AGCGCCGGTC 4200 

AATAATAATA ACCGGGCAGG 4250 

TCCATTATGT ACTATTTAAA 4300 

CTTTTTCTTT TACTTTTTTA 4350 

TTTGGCTACA TGACATCAAC 4400 

TTTGCCGCTA TTTCTCTGTT 4450 

GCTTTCTGAC AAACTCGGCC 4500 

ATGATAAGAT ACATTGATGA 4550 

AAAAAAATGC TTTATTTGTG 4600 

CCATTATAAG CTGCAATAAA 4650 

ATGTTTCAGG TTCAGGGGGA 4700 

GTCGACCTGC AGGGGCTAGA 4750 

TAATCATTAA CTACAAGGAA 4800 
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CCCCTAGTGA TGGAGTTGGC CACTCCCTCT 
TGAGGCCGGG CGACCAAAGG TCGCCCGACG 
CCTCAGTGAG CGAGCGAGCG CGCAGCTGGC 
ACCGATCGCC CTTCCCAACA GTTGCGCAGC 
CAGACGATTG AGCGTCAAAA TGTAGGTATT 
TGCAATGGCT GGCGGTAATA TTGTTCTGGA 
GTTTGAGTTC TTCTACTCAG GCAAGTGATG 
ATTGCGACAA CGGTTAATTT GCGTGATGGA 
CCTCACTGAT TATAAAAACA CTTCTCAGGA 
CTAAAATCCC TTTAATCGGC CTCCTGTTTA 
GAGGAAAGCA CGTTATACGT GCTCGTCAAA 
GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG 
GCTACACTTG CCAGCGCCCT AGCGCCCGCT 
CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG 
TCCCTTTAGG GTTCCGATTT AGTGCTTTAC 
CTTGATTAGG GTGATGGTTC ACGTAGTGGG 
TTTTCGCCCT TTGACGTTGG AGTCCACGTT 
TCCAAACTGG AAC AAC ACT C AACCCTATCT 
TAAGGGATTT TGCCGATTTC GGCCTATTGG 
ACAAAAATTT AACGCGAATT TTAACAAAAT 
TATTTGCTTA TACAATCTTC CTGTTTTTGG 
GGGGTACATA TGATTGACAT GCTAGTTTTA 
TCTTGTTTGC TCCAGACTCT CAGGCAATGA 
CCTCTCAAAA ATAGCTACCC TCTC CGG CAT 
TTGAATATCA TATTGATGGT GATTTGACTG 
TTTGAATCTT TACCTACACA TTACTCAGGC 



CTGCGCGCTC GCTCGCTCAC 4 850 

CCCGGGCTTT GCCCGGGCGG 4900 

GTAATAGCGA AGAGGCCCGC 4950 

CTGAATGGCG AATGGAATTC 5000 

TCCATGAGCG TTTTTCCTGT 5050 

TATTACCAGC AAGGCCGATA 5100 

TTATTACTAA TCAAAGAAGT 5150 

CAGACTCTTT TACTCGGTGG 5200 

TTCTGGCGTA CCGTTCCTGT 5250 

GCTCCCGCTC TGATTCTAAC 5300 

GCAACCATAG TACGCGCCCT 5350 

TGGTTACGCG CAGCGTGACC 5400 

CCTTTCGCTT TCTTCCCTTC 5450 

TCAAGCTCTA AATCGGGGGC 5500 

GGCACCTCGA CCCCAAAAAA 5550 

CCATCGCCCT GATAGACGGT 5600 

CTTTAATAGT GGACTCTTGT 5650, 

CGGTCTATTC TTTTGATTTA 5700 

TTAAAAAATG AGCTGATTTA 5750 

ATTAACGTTT ACAATTTAAA 5800 
GGCTTTTCTG ATT AT C AAC C . 5850 

CGATTACCGT TCATCGATTC 5900 

CCTGATAGCC TTTGTAGAGA 5950 

GAATTTATCA G CTAGAACGG 6000 

TCTCCGGCCT TTCTCACCCG 6050 

ATTGCATTTA AAATATATGA 6100 
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GGGTTCTAAA 


AATTTTTATC 


CTTGCGTTGA 


AATAAAGGCT 


TCTCCCGCAA 


6150 


AAGTATTACA 


GGGTCATAAT 


GTTTTTGGTA 


CAACCGATTT 


AGCTTTATGC 


6200 


TCTGAGGCTT 


TATTGCTTAA 


TTTTGCTAAT 


TCTTTGCCTT 


GCCTGTATGA 


6250 


TTTATTGGAT 


GTTGGAATTC 


CTGATGCGGT 


ATTTTCTCCT 


TACGCATCTG 


6300 


TGCGGTATTT 


CACACCGCAT 


ATGGTGCACT 


CTCAGTACAA 


TCTGCTCTGA 


6350 


TGCCGCATAG 


TTAAGCCAGC 


CCCGACACCC 


GCCAACACCC 


GCTGACGCGC 


6400 


CCTGACGGGC 


TTGTCTGCTC 


CCGGCATCCG 


CTTACAGACA 


AGCTGTGACC 


6450 


GTCTCCGGGA 


GCTGCATGTG 


TCAGAGGTTT 


TCACCGTCAT 


CACCGAAACG 


6500 


CGCGAGACGA 


AAGGGCCTCG 


TGATACGCCT 


ATTTTTATAG 


GTTAATGTCA 


6550 


TGATAATAAT 


GGTTTCTTAG 


ACGTCAGGTG 


GCACTTTTCG 


GGGAAATGTG 


6600 


CGCGGAACCC 


CTATTTGTTT 


ATTTTTCTAA 


ATACATTCAA 


ATATGTATCC 


6650 


GCTCATGAGA 


CAATAACCCT 


GATAAATGCT 


TCAATAATAT 


TGAAAAAGGA 


6700 


AGAGTATGAG 


TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 


CTTTTTTGCG 


6750 


GCATTTTGCC 


TTCCTGTTTT 


TGCTCACCCA 


GAAACGCTGG 


TGAAAGTAAA 


6800 


AGATGCTGAA 


GATCAGTTGG 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


6850 


TCAACAGCGG 


TAAGATCCTT 


GAGAGTTTTC 


GCCCCGAAGA 


ACGTTTTCCA 


6900 


ATGATGAGCA 


CTTTTAAAGT 


TCTGCTATGT 


GGCGCGGTAT 


TATCCCGTAT 


6950 


TGACGCCGGG 


CAAGAGCAAC 


TCGGTCGCCG 


CATACACTAT 


TCTCAGAATG 


7000 


ACTTGGTTGA 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


GGATGGCATG 


7050 


ACAGTAAGAG 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


7100 


GGCCAACTTA 


CTTCTGACAA 


CGATCGGAGG 


ACCGAAGGAG 


CTAACCGCTT 


7150 


TTTTGCACAA 


CATGGGGGAT 


CATGTAACTC 


GCCTTGATCG 


TTGGGAACCG 


7200 


GAGCTGAATG 


AAGCCATACC 


AAACGACGAG 


CGTGACACCA 


CGATGCCTGT 


7250 


AGCAATGGCA 


ACAACGTTGC 


GGAAACTATT 


AACTGGCGAA 


CTACTTACTC 


7300 


TAGCTTCCCG 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


TAAAGTTGCA 


7350 


GGACCACTTC 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


7400 
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ATCTGGAGCC GGTGAGCGTG GGTCTCGCGG 
CAGATGGTAA GCCCTCCCGT ATCGTAGTTA 
GCAACTATGG ATGAACGAAA TAGACAGATC 
GATTAAGCAT TGGTAACTGT CAGACCAAGT 
TTGATTTAAA ACTTCATTTT TAATTTAAAA 
TTTGATAATC TCATGACCAA AATCCCTTAA 
AGCGTCAGAC CCCGTAGAAA AGATCAAAGG 
TTCTGCGCGT AATCTGCTGC TTGCAAAC AA 
GTGGTTTGTT TGCCGGATCA AGAGCTACCA 
TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
AGTTAGGCCA CCACTTCAAG AACTCTGTAG 
CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
TACCGGGTTG GACTCAAGAC GATAGTTACC 
GCTGAACGGG GGGTTCGTGC ACACAGCCCA 
ACCGAACTGA GATACCTACA GCGTGAGCTA 
CGAAGGGAGA AAGGCGGACA GGTATCCGGT 
GAGAGCG CAC GAGGGAGCTT CCAGGGGGAA 
CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
GTCAGGGGGG CGGAGCCTAT GGAAAAACGC 
GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC 
TCCCCTGATT CTGTGGATAA CCGTATTACC 
CGCTCGCCGC AGCCGAACGA CCGAGCGCAG 
CGGAAGAGC 
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TATCATTGCA GCACTGGGGC 7450 

TCTACACGAC GGGGAGTCAG 7500 

GCTGAGATAG GTGCCTCACT 7550 

TTACTCATAT ATACTTTAGA 7600 

GGATCTAGGT GAAGATCCTT 7650 

CGTGAGTTTT CGTTCCACTG 7700 

ATCTTCTTGA GATCCTTTTT 7750 

AAAAACCACC GCTACCAGCG 7800 

ACTCTTTTTC CGAAGGTAAC 7850 

TGTCCTTCTA GTGTAGCCGT 7900 

CACCGCCTAC ATACCTCGCT 7950* 

AGTGGCGATA AGTCGTGTCT 8 000 

GGATAAGGCG CAGCGGTCGG 8050 

GCTTGGAGCG AACGACCTAC 8100 

TGAGAAAGCG CCACGCTTCC 8150; 

AAGCGGCAGG GTCGGAACAG 82 00 

ACGCCTGGTA TCTTTATAGT 8250 

CGTCGATTTT TGTGATGCTC 8300 

CAGCAACGCG GCCTTTTTAC 8350 

ACATGTTCTT TCCTGCGTTA 8400 

GCCTTTGAGT GAGCTGATAC 8450 

CGAGTCAGTG AGCGAGGAAG 8500 

8509 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GCCCAATACG 


CAAACCGCCT 


CTCCCCGCGC 


GTTGGCCGAT 


TCATTAATGC 


50 


AGCTGCGCGC 


TCGCTCGCTC 


ACTGAGGCCG 


CCCGGGCAAA 


GCCCGGGCGT 


100 


CGGGCGACCT 


TTGGTCGCCC 


GGCCTCAGTG 


AGCGAGCGAG 


CGCGCAGAGA 


150 


GGGAGTGGCC 


AACTCCATCA 


CTAGGGGTTC 


CTTGTAGTTA 


ATGATTAACC 


200 


CGCCATGCTA 


CTTATCTACA 


TCATCGATGA 


ATTCGAGCTT 


GCATGCCTGC 


250 


AGGTCGTTAC 


ATAACTTACG 


GTAAATGGCC 


CGCCTGGCTG 


ACCGCCCAAC 


300 


GACCCCCGCC 


CATTGACGTC 


AATAATGACG 


TATGTTCCCA 


TAGTAACGCC 


350 


AATAGGGACT 


TTCCATTGAC 


GTCAATGGGT 


GGAGTATTTA 


CGGTAAACTG 


400 


CCCACTTGGC 


AGTACATCAA 


GTGTATCATA 


TGCCAAGTAC 


GCCCCCTATT 


450 


GACGTCAATG 


ACGGTAAATG 


GCCCGCCTGG 


CATTATGCCC 


AGTACATGAC 


500 


CTTATGGGAC 


TTTCCTACTT 


GGCAGTACAT 


CTACGTATTA 


GTCATCGCTA 


550 


TTACCATGGT 


GATGCGGTTT 


TGGCAGTACA 


TCAATGGGCG 


TGGATAGCGG 


600 


TTTGACTCAC 


GGGGATTTCC 


AAGTCTCCAC 


CCCATTGACG 


TCAATGGGAG 


650 


TTTGTTTTGG 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 


CGTAACAACT 


700 


CCGCCCCATT 


GACGCAAATG 


GGCGGTAGGC 


GTGTACGGTG 


GGAGGTCTAT 


750 


ATAAGCAGAG 


CTCGTTTAGT 


GAACCGTCAG 


ATCGCCTGGA 


GACGCCATCC 


800 


ACGCTGTTTT 


GACCTCCATA 


GAAGACACCG 


GGACCGATCC 


AGCCTCCGGA 


850 


CTCTAGAGGA 


TCCGGTACTC 


GACCCGAGCT 


CGGATCCACT 


AGTAACGGCC 


900 


GCCAGTGTGC 


TGGAATTCTG 


CACTCCAGGC 


TGCCCGGGTT 


TGCATGCTGC 


950 


TGCTGCTGCT 


GCTGCTGGGC 


CTGAGGCTAC 


AGCTCTCCCT 


GGGCATCATC 


1000 
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CTAGTTGAGG AGGAGAACCC GGACTTCTGG 
CCTGGGTGCC GCCAAGAAGC TGCAGCCTGC 
TCATCATCTT CCTGGGCGAT GGGATGGGGG 
AGGATCCTAA AAGGGCAGAA GAAGGACAAA 
GGCCATGGAC CGCTTCCCAT ATGTGGCTCT 
ACAAACATGT GCCAGACAGT GGAGCCACAG 
GTCAAGGGCA ACTTCCAGAC CATTGGCTTG 
CCAGTGCAAC ACGACACGCG GCAACGAGGT 
CCAAGAAAGC AGGGAAGTCA GTGGGAGTGG 
CACGCCTCGC CAGCCGGCAC CTACGCCCAC 
CTCGGACGCC GACGTGCCTG CCTCGGCCCG 
TCGCTACGCA GCTCATCTCC AACATGGACA 
GGCCGAAAGT ACATGTTTCG CATGGGAACC 
TGACTACAGC CAAGGTGGGA CCAGGCTGGA 
AATGGCTCGG CGAACGCCAG GGTGCCCGGT 
CTCATGCAGG CTTCCCTGGA CCCGTCTGTG 
TGAGCCTGGA GACATGAAAT ACGAGATCCA 
CCTCCCTGAT GGAGATGACA GAGGCTGCCC 
CCCCGCGGCT TCTTCCTCTT CGTGGAGGGT 
TCATGAAAGC AGGGCTTACC GGGCACTGAC 
ACGCCATTGA GAGGGCGGGC CAGCTCACCA 
CTCGTCACTG CCGACCACTC CCACGTCTTC 
G CGAGGG AG C TCCTTCATCG GGCTGGCCGC 
AGGCCTACAC GGTCCTCCTA TACGGAAACG 
GACGGCGCCC GGCCGGATGT TACCGAGAGC 
TCGGCAGCAG TCAGCAGTGC CCCTGGACGA 



AACCGCGAGG CAGCCGAGGC 1050 

ACAGACAGCC GCCAAGAACC 1100 

TGTCTACGGT GACAGCTGCC 1150 

CTGGGGCCTG AGATACCCCT 1200 

GTCCAAGACA TACAATGTAG 12 50 

CCACGGCCTA CCTGTGCGGG 13 00 

AGTGCAGCCG CCCGCTTTAA 1350 

CATCTCCGTG ATGAATCGGG 14 00 

TAACCACCAC ACGAGTGCAG 1450 

ACGGTGAACC GCAACTGGTA 1500 

CCAGGAGGGG TGCCAGGACA 1550 

TTGATGTGAT CCTAGGTGGA 1600 

CCAGACCCTG AGTACCCAGA 1650 

CGGGAAGAAT CTGGTGCAGG 1700, 

ACGTGTGGAA CCGCACTGAG 1750 

ACCCATCTCA TGGGTCTCTT 18 00. 

CCGAGACTCC ACACTGGACC 1850^ 

TGCGCCTGCT GAGCAGACAC 1900 

GGTCGCATCG ACCATGGTCA 1950 

TGAGACGATC ATGTTCGACG 2000 

GCGAGGAGGA CACGCTGAGC 2050 

TCCTTCGGAG GCTACCCCCT 2100 

TGGCAAGGCC CGGGACAGGA 2150 

GTCCAGGCTA TGTGCTCAAG 22 00 

GAGAGCGGGA GCCCCGAGTA 2250 

AGAGACCCAC GCAGGCGAGG 2300 
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ACGTGGCGGT 


GTTCGCGCGC 


GGCCCGCAGG 


CGCACCTGGT 


TCACGGCGTG 


2350 


CAGGAGCAGA 


CCTTCATAGC 


GCACGTCATG 


GCCTTCGCCG 


CCTGCCTGGA 


2400 


GCCCTACACC 


GCCTGCGACC 


TGGCGCCCCC 


CGCCGGCACC 


ACCGACGCCG 


2450 


CGCACCCGGG 


GCGGTCCGTG 


GTCCCCGCGT 


TGCTTCCTCT 


GCTGGCCGGG 


2500 


ACCCTGCTGC 


TGCTGGAGAC 


GGCCACTGCT 


CCCTGAGTGT 


CCCGTCCCTG 


2550 


GGGCTCCTGC 


TTCCCCATCC 


CGGAGTTCTC 


CTGCTCCCCA 


CCTCCTGTCG 


2600 


TCCTGCCTGG 


CCTCCAGCCC 


GAGTCGTCAT 


CCCCGGAGTC 


CCTATACAGA 


2650 


GGTCCTGCCA 


TGGAACCTTC 


CCCTCCCCGT 


GCGCTCTGGG 


GACTGAGCCC 


2700 


ATGACACCAA 


ACCTGCCCCT 


TGGCTGCTCT 


CGGACTCCCT 


ACCCCAACCC 


2750 


CAGGGACTGC 


AGGTTGTGCC 


CTGTGGCTGC 


CTGCACCCCA 


GGAAAGGAGG 


2800 


GGGCTCAGGC 


CATCCAGCCA 


CCACCTACAG 


CCCAGTGGGG 


TCGAGACAGA 


2850 


TGGTCAGTCT 


GGAGGATGAC 


GTGGCGTGAA 


GCTGGCCGCG 


GGGATCCAGA 


2900 


CATGATAAGA 


TACATTGATG 


AGTTTGGACA 


AACCACAACT 


AGAATGCAGT 


2950 


GAAAAAAATG 


CTTTATTTGT 


GAAATTTGTG 


ATGCTATTGC 


TTTATTTGTA 


3000 


ACCATTATAA 


GCTGCAATAA 


ACAAGTTAAC 


AACAACAATT 


GCATTCATTT 


3050 


TATGTTTCAG 


GTTCAGGGGG 


AGGTGTGGGA 


GGTTTTTTCG 


GATCCTCTAG 


3100 


AGTCGACTCT 


AGANNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3150 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3200 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3250 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3300 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3350 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNGGATCCC 


3400 


CATGACTACG 


TCCGGCGTTC 


CATTTGGCAT 


GACACTACGA 


CCAACACGAT 


3450 


CTCGGTTGTC 


TCGGCGCACT 


CCGTACAGTA 


GGGATCGTCT 


ACCTCCTTTT 


3500 


GAGACAGAAA 


CCCGCGCTAC 


CATACTGGAG 


GATCATCCGC 


TGCTGCCCGA 


3550 


ATGTAACACT 


TTGACAATGC 


ACAACGTGAG 


TTACGTGCGA 


GGTCTTCCCT 


3600 
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GCAGTGTGGG ATTTACGCTG ATTCAGGAAT GGGTTGTTCC CTGGGATATG 3 650 

GTTCTAACGC GGGAGGAGCT TGTAATCCTG AGGAAGTGTA TGCACGTGTG 3700 

CCTGTGTTGT GCCAACATTG ATATCATGAC GAGCATGATG ATCCATGGTT 3 750 

ACGAGTCCTG GGCTCTCCAC TGTCATTGTT CCAGTCCCGG TTCCCTGCAG 3 800 

TGTAT AG CCG GCGGGCAGGT TTTGGCCAGC TGGTTTAGGA TGGTGGTGGA 3850 

TGGCGCCATG TTTAATCAGA GGTTTATATG GTACCGGGAG GTGGTGAATT 3900 

ACAACATGCC AAAAGAGGTA ATGTTTATGT CCAGCGTGTT TATGAGGGGT 3950 

CGCCACTTAA TCTACCTGCG CTTGTGGTAT GATGGCCACG TGGGTTCTGT 4 000 

GGTCCCCGCC ATGAGCTTTG GATACAGCGC CTTGCACTGT GGGATTTTGA 4 050 

ACAATATTGT GGTGCTGTGC TGCAGTTACT GTGCTGATTT AAGTGAGATC 4100 

AGGGTGCGCT GCTGTGCCCG GAGGACAAGG CGCCTTATGC TGCGGGCGGT 4150 

GCGAATCATC GCTGAGGAGA CCACTGCCAT GTTGTATTCC TGCAGGACGG 4200 

AGCGGCGGCG GCAGCAGTTT ATTCGCGCGC TGCTGCAGCA CCACCGCCCT 4250 

ATCCTGATGC ACGATTATGA CTCTACCCCC ATGTAGGGAT CCCCATCACT 43 00. 

AGTGCGGCCG CGGGGATCCA GACATGATAA GATACATTGA TGAGTTTGGA 4 3 50; 

CAAACCACAA CTAGAATGCA GTGAAAAAAA TGCTTTATTT GTGAAATTTG 4400' 

TGATGCTATT GCTTTATTTG TAACCATTAT AAGCTGCAAT AAACAAGTTA 44 50*. 

ACAACAACAA TTGCATTCAT TTTATGTTTC AGGTTCAGGG GGAGGTGTGG 4500 

GAGGTTTTTT CGGATCCTCT AGAGTCGACC TGCAGGCATG CAAGCTGTAG 4 550 

ATAAGTAGCA TGGCGGGTTA ATCATTAACT ACAAGGAACC CCTAGTGATG 4 600 

GAGTTGGCCA CTCCCTCTCT GCGCGCTCGC TCGCTCACTG AGGCCGGGCG 4 650 

ACCAAAGGTC GCCCGACGCC CGGGCTTTGC CCGGGCGGCC TCAGTGAGCG 4 7 00 

AGCGAGCGCG CAGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT 4 750 

TCCCAACAGT TGCGCAGCCT GAATGGCGAA TGGAANTTCC AGACGATTGA 4800 

GCGTCAAAAT GTAGGTATTT CCATGAGCGT TTTTCCTGTT GCAATGGCTG 4850 

GCGGTAATAT TGTTCTGGAT ATTACCAGCA AGGCCGATAG TTTGAGTTCT 4900 
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TCTACTCAGG CAAGTGATGT TATTACTAAT CAAAGAAGTA TTGCGACAAC 4950 

GGTTAATTTG CGTGATGGAC AGACTCTTTT ACTCGGTGGC CTCACTGATT 5000 

ATAAAAACAC TTCTCAGGAT TCTGGCGTAC CGTTCCTGTC TAAAATCCCT 5050 

TTAATCGGCC TCCTGTTTAG CTCCCGCTCT GATTCTAACG AGGAAAGCAC 5100 

GTTATACGTG CTCGTCAAAG CAACCATAGT ACGCGCCCTG TAG C GGCG C A 5150 

TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 5200 

CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA 5250 

CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT CCCTTTAGGG 53 00 

TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC TTGATTAGGG 53 50 

TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT TTTCGCCCTT 54 00 

TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT CCAAACTGGA 5450 

AC AAC ACT C A ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT 5500 

GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA 5550 

ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTAAAT ATTTGCTTAT 5600 

ACAATCTTCC TGTTTTTGGG GCTTTT CTG A TTATCAACCG GGGTACATAT 5650 

GATTGACATG CTAGTTTTAC GATTACCGTT CATCGATTCT CTTGTTTGCT 5700 

CCAGACTCTC AGGCAATGAC CTGATAGCCT TTGTAGAGAC CTCTCAAAAA 5750 

TAGCTACCCT CTCCGGCATG AATTTATCAG CTAGAACGGT TGAATATCAT 5800 

ATTGATGGTG ATTTGACTGT CTCCGGCCTT TCTCACCCGT TTGAATCTTT 5850 

ACCTACACAT TACTCAGGCA TTGCATTTAA AATATATGAG GGTTCTAAAA 59 00 

ATTTTTATCC TTGCGTTGAA ATAAAGGCTT CTCCCGCAAA AGTATTACAG 5950 

GGTCATAATG TTTTTGGTAC AACCGATTTA GCTTTATGCT CTGAGGCTTT 6000 

ATTGCTTAAT TTTGCTAATT CTTTGCCTTG CCTGTATGAT TTATTGGATG 6050 

TTGGAANTTC CTGATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT 6100 

CACACCGCAT ATGGTGCACT CTCAGTACAA TCTGCTCTGA TGCCGCATAG 6150 

TTAAGCCAGC CCCGACACCC. GCCAACACCC GCTGACGCGC CCTGACGGGC 6200 
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TTGTCTGCTC CCGGCATCCG CTTACAGACA AG CTGTG AC C GTCTCCGGGA 6250 

GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGAGACGA 63 00 

AAGGGCCTCG TGATACGCCT ATTTTTATAG GTTAATGTCA TGATAATAAT 6350 

GGTTTCTTAG ACGTCAGGTG GCACTTTTCG GGGAAATGTG CGCGGAACCC 6400 

CTATTTGTTT ATTTTTCTAA ATACATTCAA ATATGTATCC GCTCATGAGA 6450 

CAATAACCCT GATAAATGCT TCAATAATAT TGAAAAAGGA AGAGTATGAG 6500 

TATTCAACAT TTCCGTGTCG CCCTTATTCC CTTTTTTGCG GCATTTTGCC 6550 

TTCCTGTTTT TGCTCACCCA GAAACGCTGG TGAAAGTAAA AGATGCTGAA 6600 

GATCAGTTGG GTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG 6650 

TAAGATCCTT GAGAGTTTTC GCCCCGAAGA ACGTTTTCCA ATGATGAGCA 6700 

CTTTTAAAGT TCTGCTATGT GGCGCGGTAT TATCCCGTAT TGACGCCGGG 6750 

CAAGAGCAAC TCGGTCGCCG CATACACTAT TCTCAGAATG ACTTGGTTGA 6800 

GTACTCACCA GTCACAGAAA AGCATCTTAC GGATGGCATG ACAGTAAGAG 6850 

AATTATGCAG TGCTGCCATA ACCATGAGTG ATAACACTGC GGCCAACTTA 6900 

CTTCTGACAA CGATCGGAGG ACCGAAGGAG CTAACCGCTT TTTTGCACAA 6950 

CATGGGGGAT CATGTAACTC GCCTTGATCG TTGGGAACCG GAGCTGAATG 7000 

AAGCCATACC AAACGACGAG CGTGACACCA CGATGCCTGT AGCAATGGCA 7050 

ACAACGTTGC GCAAACTATT AACTGGCGAA CTACTTACTC TAGCTTCCCG 7100 

GCAACAATTA ATAGACTGGA TGGAGGCGGA TAAAGTTGCA GGACCACTTC 7150 

TGCGCTCGGC CCTTCCGGCT GGCTGGTTTA TTGCTGATAA ATCTGGAGCC 7200 

GGTGAGCGTG GGTCTCGCGG TATCATTGCA GCACTGGGGC CAGATGGTAA 7250 

GCCCTCCCGT ATCGTAGTTA TCTACACGAC GGGGAGTCAG GCAACTATGG 73 00 

ATGAACGAAA TAGACAGATC G CTG AG AT AG GTGCCTCACT GATTAAGCAT 7350 

TGGTAACTGT C AG AC C AAGT TTACTCATAT ATACTTTAGA TTGATTTAAA 7400 

ACTTCATTTT TAATTTAAAA GGATCTAGGT GAAGATCCTT TTTGATAATC 7450 

TCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC 7500 
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CCCGTAGAAA AGATCAAAGG ATCTTCTTGA 
AATCTGCTGC TTGCAAACAA AAAAACCACC 
TGCCGGATGA AGAGCTACCA ACTCTTTTTC 
AGAGCGCAGA TACCAAATAC TGTCCTTCTA 
CCACTTCAAG AACTCTGTAG CACCGCCTAC 
TGTTACCAGT GGCTGCTGCC AGTGGCGATA 
GACTCAAGAC GATAGTTACC GGATAAGGCG 
GGGTTCGTGC ACACAGCCCA GCTTGGAGCG 
GATACCTACA GCGTGAGCTA TGAGAAAGCG 
AAGGCGGACA GGTATCCGGT AAGCGGCAGG 
GAGGGAGCTT GCAGGGGGAA ACGCCTGGTA 
TTCGCCACCT CTGACTTGAG CGTCGATTTT 
CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
CTTTTGCTGG C CTTTTGCT C ACATGTTCTT 
CTGTGGATAA CCGTATTACC GCCTTTGAGT 
AGCCGAACGA CCGAGCGCAG CGAGTCAGTG 



GATCCTTTTT TTCTGCGCGT 7550 

GCTACCAGCG GTGGTTTGTT 7600 

CGAAGGTAAC TGGCTTCAGC 7650 

GTGTAGCCGT AGTTAGGCCA 7700 

ATACCTCGCT CTGCTAATCC 7750 

AGTCGTGTCT TACCGGGTTG 7800 

CAGCGGTCGG GCTGAACGGG 7850 

AACGACCTAC ACCGAACTGA 7900 

CCACGCTTCC CGAAGGGAGA 7950 

GTCGGAACAG GAGAGCGCAC 8000 

TCTTTATAGT CCTGTCGGGT 8050 

TGTGATGCTC GTCAGGGGGG 8100 

GCCTTTTTAC GGTTCCTGGC 8150 

TCCTGCGTTA TCCCCTGATT 8200 

GAGCTGATAC CGCTCGCCGC 8250 

AGCGAGGAAG CGGAAGAGC 8299 
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WHAT IS CLAIMED IS: 

1. A method for enhancing the efficiency of 
transduction of a recombinant AAV into a target cell ex 
vivo comprising the steps of providing a recombinant 
adeno-associated virus comprising: (a) the DNA of at 
least a portion of the genome of an adeno-associated 
virus which portion is capable of transducing a selected 
gene into a target cell in the absence of cell division; 
and (b) a selected gene operatively linked to regulatory 
sequences directing its expression, said gene flanked by 
the DNA of (a) and capable of expression in the target 
cell; infecting said target cells with said recombinant 
adeno-associated virus; contacting said infected cells 
with an agent which facilitates the conversion of said ss 
recombinant virus to its double stranded form, wherein 
said conversion occurs in said target cell, resulting in 
enhanced transduction of said recombinant virus into said 
target cell. 

2. The method according to claim 1 wherein said 
agent is a helper virus comprising a selected gene 
encoding a polypeptide which can enhance the conversion 
of said single-stranded recombinant virus to double- 
stranded recombinant virus or functional fragment 
thereof, said helper virus capable of transducing said 
selected gene into the target cell in the absence of cell 
division. 

3 . The method according to claim 2 wherein said 
contacting step comprises co-infecting said target cell 
with said helper virus. 

4 • The method according to claim 3 wherein said 
helper virus is an adenovirus. 
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5. The method according to claim 2 wherein said 
selected gene is the adenovirus E4 gene or a functional 
fragment thereof • 

6. The method according to claim 2 wherein said 
adenovirus contains two selected genes or functional 
fragments thereof, said genes being the adenovirus E4 
gene and El gene . 

7. The method according to claim 5 or 6 wherein 
said functional fragment of said E4 gene comprises the 
open reading frame 6 of E4 . 

8 . The method according to claim 1 wherein said 
recombinant adeno-associated virus further comprises a 
second selected gene operatively linked to regulatory 
sequences capable of directing expression of said second 
gene, said second gene capable of facilitating the 
conversion of said single stranded recombinant virus to 
its double stranded form upon expression, and capable of 
co-expression with said first selected gene in the target 
cell, whereby the second gene product is expressed in the 
target cell, wherein said conversion occurs in said 
target cell, resulting in enhanced transduction of said 
recombinant virus in said target cell. 

9. The method according to claim 8 wherein the 
regulatory sequences directing expression of said second 
gene comprise an inducible promoter, and wherein 
expression of said second gene occurs in the presence of 
an inducing agent. 
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10. A recombinant adeno-associated virus comprising 
(a) the DNA of at least a portion of the genome of an 
adeno-associated virus which portion is capable of 
transducing at least two selected genes or functional 
fragments thereof into a target cell in the absence of 
cell division; (b) a first selected gene operatively 
linked to regulatory sequences directing its expression, 
(c) a second selected gene operatively linked to 
regulatory sequences capable of directing expression of 

said second gene, said second gene capable of 

facilitating the conversion of said single stranded 
recombinant virus to its double stranded form upon 
expression, said first and said second genes flanked by 
the DNA of (a) , said first and second gene capable of co- 
expression in the target cell. 

11. The recombinant virus according to claim 10 
wherein said second gene is selected from the group 
consisting of an adenovirus E4 gene, ORF6 of E4 and a ^ 
functional fragment thereof. 

12 . The recombinant virus according to claim 10 -r 
wherein said second gene is the adenovirus El gene or a 
functional fragment thereof. 

13 . The recombinant virus according to claim 10 
wherein the regulatory sequences directing expression of 
said second gene comprise an inducible promoter, and 
wherein expression of said second gene occurs in the 
presence of an inducing agent. 
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14 . The recombinant virus according to claim 10 
further comprising (d) an additional selected gene 
operatively linked to regulatory sequences capable of 
directing expression of said gene, said additional gene 
and said second gene capable of jointly facilitating the 
conversion of said single stranded recombinant virus to 
its double stranded form upon expression of both said 
second and additional genes, said first, second and 
additional genes flanked by the DNA of (a) , and capable 
of co-expression in the target cell. 

15. The recombinant virus according to claim 14 
wherein said second gene is the adenovirus E4 gene or a 
functional fragment thereof and said additional gene is 
the adenovirus El gene or a functional fragment thereof. 

16. The recombinant virus according to claim 15 
wherein said functional fragment of E4 is the ORF6 
sequence. 

17 . The recombinant virus according to claim 15 
wherein the regulatory sequences directing expression of 
said additional gene comprise an inducible promoter, and 
wherein expression of said additional gene occurs in the 
presence of an inducing agent. 

18. A method for enhancing the efficiency of 
transduction of a recombinant AAV into a target cell 
comprising the steps of: providing a recombinant adeno- 
associated virus of claim 10 through 16; infecting said 
target cells with said recombinant virus; and culturing 
said infected cells under conditions which enable 
expression of said selected genes in said target cell, 
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wherein said conversion occurs in said target cell, 
resulting in enhanced transduction of said recombinant 
virus in said target cell. 

19. A method for enhancing the efficiency of 
transduction of a recombinant AAV into a target cell 
comprising the steps of : providing a recombinant adeno- 
associated virus of claim 17; infecting said target cells 
with said recombinant virus; and contacting said target 
cell with an inducing agent which induces the expression 
of said second or additional gene in said target cell, 
wherein said conversion occurs in said target cell, 
resulting in enhanced transduction of said recombinant - 
virus in said target cell. 

20. A pharmaceutical composition comprising a 
recombinant adeno-associated virus and an agent which 
facilitates the conversion of said ss recombinant virus- 
to its double stranded form in a target cell. 

21. The composition according to claim 20 wherein 
said virus is selected from the group consisting of the 
recombinant viruses of claims 10-16. 

22. The composition according to claim 20 wherein 
said virus is the recombinant virus of claim 17 and 
further comprising an inducing agent which induces the 
expression of said second or additional gene in said 
recombinant virus. 

23. A mammalian cell transduced with the 
recombinant adeno-associated viruses of claims 10-17. 



WO 96/39530 



1 /11 



PCT/US96/10245 




SV40 A+ 

FIG. I 



SUBSTITUTE SHEET (RULE 26) 



WO 96/39530 PCT/US96/10245 

2/11 




SUBSTITUTE SHEET (RULE 26) 



PCT/US96/10245 



3/11 




SUBSTITUTE SHEET (RULE 26) 



WO 96/39530 



PCT/US96/10245 



4/11 



1.0E+09 




0.0E+00 V 



r 

100 120 140 
CONC. OF DEX (pM) 
YIELD AT 24 HR P.I. (LFU/ml) 

ORF6 PROTEIN DETECTED AT 24HR P.l. IN 27-18 CELLS (Abs.mm) 



~r 
180 



FIG. 4A 



5.0E+08 
!l 4.0E+08 - 



0.0E+00 




75 100 125 150 
CONC. OF ZINC 0*M) 



175 



200 



YIELD AT 24 HR P.I. (LFU/ml) 

0RF6 PROTEIN DETECTED IN 10-3 CELLS AT 24 HR P.I.(Abs.mm) 

FIG. 4B 

SUBSTITUTE SHEET (RULE 26) 



15 



10 



- 5 



225 



E 
E 
in 

3 



o 
or 



o 



WO 96/39530 PCT/US96/10245 

5/11 




FOLD- 












225 






INDUCTION 


1.0 


107 


15 


1.4 


134 


1.6 


10 



FIG. 5A 

800 -i 




■ dl802 
0 Ad5 

FIG. 5B 



SUBSTITUTE SHEET (RULE 26) 



PCT/US96/10245 

6/11 




MOI 

FIG. 6A 




o •>— csiro^*- m cor- coo>o — 



MOI 

FIG. 6B 



SUBSTITUTE SHEET (RULE 26) 



WO 96/39530 ^ PCT/US96/10245 

7/11 



FIG. 7A 




FIG. 7B -*=!> 



✓-Notl(1035) /-Notl(4509) 

■ « g! 



FIG. 7C 



FIG. 7E 



±3. 



Notl(4509) 



j-Notl(4509) 

izsczz 



Notl DIGEST 



r Notl(4509) 

FIG. 7D ' fi= 



:l(4509) 

Notl(4509) 



Notl DIGEST 



Notl(4509) -v 

FIG. 7F SI 



ITR CMV LocZ pA ITR. 

SD/SA *361~bp 



SUBSTITUTE SHEET (RULE 26) 



WO 96/39530 



PCT7US96/10245 



FIG. 8A 



FIG. 8B 



FIG. 8C 



8/11 

1250 



m m zinc 




25000 
20000 
3 15000 

Q- 
O 

10000 
5000 



HM ZINC 
FOLD- 
INDUCTION 




0 


50 


100 


150 


200 


250 


1 


1.9 


9.4 


22.6 


25.2 


17.1 




SUBSTITUTE SHEET (RULE 26) 



WO 96/39530 



9/11 



PCT/US96/10245 



8O-1 



60- 



3 

o 40 H 
uT 
o 

UJ 
CL 
CO 

20- 



HeLo 



M M ZINC 



o 

Q => 
Pi 





0 


50 


100 


150 


200 


250 


1 


3 


3.5 


5.7 


20.7 


38.3 


26.2 




-1 


1.2 


1.9 


6.9 


12.8 


8.7 



FIG. 9 



SUBSTITUTE SHEET (RULE 26) 



* • 



WO 96/39530 ^ PCT/US96/1024S 

10/ 11 



SnaBI(217) 



Clal(5893) 




SnaBI(523) 
' Sacrf749} / Xb °'(843) 

Xhol(859) 
BamHI(927) 

Notl(1037) 
Smal(1049) 

Pvull(1265) 



Clal(1998) 



Pvull(4923) 

SnaBi(4756) 
Sall(4731} 
Xbal(4725) y 
BamHI(4719) 



Pvull(3822) 
Pvul!(4185) 



Notl(4511) 
BamHI(4521) 



Sacl(3111) 



FIG. IO 



SUBSTITUTE SHEET (RULE 26) 



WO 96/39530 



# 



* 



PCT/US96/10245 



11 /11 



EcoRI(229) 



Clal(5682) 




Xbal(4518) 



FIG. II 



SUBSTITUTE SHEET (RULE 26) 



D INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



# 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/86, 15/87, 5/10, A61K 35/76, 
48/00 



A3 



(11) International Publication Number: WO 96/39530 

(43) International Publication Date: 12 December 1996 (12.12.96) 



(21) International Application Number: PCT/US96/ 10245 

(22) International Filing Date: 4 June 1996 (04.06.96) 



(30) Priority Data: 

08/462,014 
08/549,489 



5 June 1995 (05.06.95) US 
27 October 1995 (27.10.95) US 



(60) Parent Applications or Grants 

(63) Related by Continuation 
US 

Filed on 

US 

_ Filed on _ 



08/462,014 (CIP) 
5 June 1995 (05.06.95) 
08/549,489 (CIP) 
27 October 1995 (27.10.95) 



(71) Applicant (for all designated States except US): THE 

TRUSTEES OF THE UNIVERSITY OF PENNSYLVA- 
NIA [US/US]; 133 South 36th Street, Philadelphia. PA 
19104-3246 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): WILSON, James, M. 

[US/US]; 1350 N. Avignon Drive, Gladwyne, PA 19035 
(US). FISHER, Krishna, J. [US/USj; 4006 Pine Street, 
Philadelphia, PA 19104 (US). GAO, Guang-Ping [CN/US]; 



Robindale Apartment F5, 1925 Lawrence Road, Havertown, 
PA 19083 (US). 

(74) Agents: BAK, Mary, E. et at.; Howson and Howson, Spring 
House Corporate Center, P.O. Box 457, Spring House, PA 
19477 (US). 



(81) Designated States: AL, AM, AU, AZ, BB, BG, BR. BY, CA, 
CN, CZ, EE, FI, GE, HU. IL, IS, JP, KE, KG. KP. KR, KZ. 
LK, LR, LS. LT, LV. MD, MG, MK, MN. MW, MX. NO. 
NZ. PL, RO, RU, SD, SG, SI, SK, TJ, TM. TR, TT. UA. 
UG, US, UZ, VN. ARIPO patent (KE, LS, MW, SD. SZ, 
UG), Eurasian patent (AM, AZ, BY, KG. KZ. MD, RU, TJ, 
TM), European patent (AT. BE, CH. DE; DK. ES, FI, FR, 
GB. GR, IE. IT, LU, MC, NL, PT, SE), OAPl patent (BF, 
BJ CF CO, CI. CM, GA, GN, ML. MR, NE, SN, TD, TO). 



Published 

With international search report 

(88) Date of publication of the international search report: 

v 5 June 1997 (05.06.97) 



(54) Title: RECOMBIANT ADENOVIRUS AND ADENO- ASSOCIATED VIRUS. CELL LINES, AND METHODS OF PRODUCTION 
AND USE THEREOF 



(57) Abstract 

An adenovirus E1/E4 expressing packaging cell line is provided, which permits the generation of recombinant adenoviruses deleted 
in both gene regions. A method for enhancing the efficiency of transduction of a recombinant AAV into a target cell is provided by 
infecting a target cell with a recombinant AAV comprising a selected transgene under the control of regulatory sequences. The infected 
cell is contacted with an agent which facilitates the conversion of single stranded recombinant virus to its double stranded form. 



• ♦ 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Armenia 


GB 


United Kingdom 


MW 


MaJawi 


AT 


Austria 


GE 


Georgia 


MX 


Mexico 


AL- 


Australia 


GN 


Guinea 


- NE 


Niger 


BS 


Barbados 


GR 


Greece 


NL 


Netherlands 


BE 


Belgium 


HU 


Hungary 


NO 


Norway 


BF 


Burkina Faso 


IE 


Ireland 


NZ 


New Zealand 


BG 


Bulgaria 


IT 


Italy 


PL 


Poland 


BJ 


Benin 


JP 


Japan 


FT 


Portugal 


BR 


Brazil 


KE 


Kenya 


RO 


Romania 


BY 


Belarus 


KG 


Kyrgystan 


RU 


Russian Federation 


CA 


Canada 


KP 


Democratic People's Republic 


SD 


Sudan 


CF 


Central African Republic 




of Korea 


SE 


Sweden 


CG 


Congo 


KR 


Republic of Korea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 


SI 


Slovenia 


CI 


Cote d*T voire 


LI 


Liechtenstein 


SK 


Slovakia 


CM 


Cameroon 


LK 


Sri Lanka 


SN 


Senegal 


CN 


China 


LR 


Liberia 


sz 


Swaziland 


CS 


Czechoslovak i a 


LT 


Lithuania 


TD 


Chad 


CZ 


Czech Republic 


LU 


Luxembourg 


TG 


Togo 


DE 


Germany 


LV 


Latvia 


TJ 


Tajikistan 


DK 


Denmark 


MC 


Monaco 


TT 


Trinidad and Tobago 


EE 


Estonia 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MG v 


Madagascar 


UG 


Uganda 


FI 


Finland 


ML 


Mali 


US 


United States of America 


FR 


France 


MN 


Mongolia 


uz 


Uzbekistan 


GA 


Gabon 


MR 


Mauritania 


VN 


Vict Nam 



INTERNA 



AL, SEARCH REPORT 




Inte ^■^FApplicaOon No 

PCT/US 96/10245 



A. CLASSIFICATION OK SUBJECT MATTER _ „ 

IPC 6 C12N15/86 C12N15/87 C12N5/10 A61K35/76 A61K48/00 



According to intemaoonal Patent Oassihcaoon (IPC) or to both national cla-valicaQon and IPC 



8: FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C12N A61K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the intcmaoonai search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * | Citation oi document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



p.x 



P,Y 
P.X 



P.Y 



J. VIROLOGY, 

vol. 70, no. 1, January 1996, 

pages 520-532, XP0O057O468 

FISHER, K.J. ETAL.: "Transduction with 

recombinant Adeno-Associated Virus for 

gene therapy is limited by leading-strand 

synthesis" 

see the whole document 
J. VIROLOGY, 

vol. 70, no. 3, March 1996, 

pages 1845-1854, XP002027273 

WEITZMAN, M.D. ET AL. : "Recruitment of 

wild-type and recombinant Adeno-Associated 

Virus into Adenovirus replication centers" 

see the whole document 



-/■ 



1-7 



8-23 
1-7 



8-23 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



* Special categories of a ted documents : 

'A' document defining the general state of the an which is not 
considered to be of particular relevance 

*E" earlier document but published on or after the international 
filing date 

*L" document which may throw doubts on priority cJaimf s) or 
which is a ted to establish the publication date o( another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

*P* document published prior to the intcmaoonai filing date but 
later than the priority dale claimed 



"T* later document published after the international filing date 
or pnonty date and not in conflict with the application but 
cited to understand the principle or theory under! ying the 
in vena on 

*X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an tnvenove step when the document is taken alone 

* Y " document of particular relevance; the claimed invention 

cannot be considered to involve an tnvenove step when the 
document is combined with one or more other such docu- 
ments, such combinaoon being obvious to a person skilled 
in the art. 

"A" document member of the same patent family 



Date o( the actual compleoon of the intcrnaaonal search 



10 March 1997 



Date of mailing oi the international search report 



25. 03. 97 



Name and mailing address of the ISA 

European Patent Office. P.B. 5818 PatentJaan 2 
NL - 2280 HV R M swi,k 

Tel. ( ? 31-^0) 34U-204U. Tx. 31 651 epo nJ. 
l-ajc ( - 3l-?0> 340*3016 



Authorized officer 



Donath, C 



Form PCT ISA 210 (tccorta ihee*) (Jwly |992l 



page 1 of 2 



INT ATIONAL SEARCH REPORT 



* 



C.( Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Inu. onal Application No 

PCT/US 96/10245 



Citation ot document, with mcUcaaon, where appropnate. of the relevant parage? 



Relevant to claim No. 



P.X 



WO 95 20671 A { RH0NE-P0ULENC RORER S.A ) 3 
August 1995 ' 
see page 2, line 33 - page 3, line 30 
see page 5, line 28 - page 7, line 24- 
examples 2-5 

WO 96 13598 A (THE TRUSTEES OF THE 
UNIVERSITY OF PENNSYLVANIA) 9 May 1996 
see page 4, line 23 - page 5, line 25 
see page 8, line 1 - page 15, line 13 
see page 18, line 28 - page 19, line 13 

WO 96 22378 A (RH0NE-P0ULENC RORER S.A ) 

25 July 1996 ' 

see page 3, 1 ine 3 - 1 ine 21 

see page 7, line 17 - line 32 

see page 13, line 27 - page 16, line 26 

GENE, 

vol. 119, no. 2, 1992, 
pages 265-272, XP002027274 
NAHREINI, P. ET AL. : "Cloning and 
integration of DNA fragments in human 
cells via the inverted terminal repeats of 
the adeno-associated virus 2 genome" 
see the whole document 

CRC HANDBOOK OF PARVOVIRUSES, 

ED.P.TIJSSER, 

vol. I, 1990, 

pages 155-168, XP002027275 

CARTER, B.J.: "The growth cycle of 

adeno-associated virus" 

see page 158, last paragraph - page 161 

paragraph 1 



1-7 



1-7 



1-7 



1-7 



8-23 



PCT ISA J|Q «nflunu»uor> ot iccond ihect) (July 1992) 



page 2 of 2 



INTERNAT^^AL SEARCH REPORT 

Information on patent iarruly members 




tnu ^^■F ApplicAaon So 

PCT/US 96/10245 



Patent document 
cued in search report 



Publication 
date 



Patent family 
member! s) 



Publication 
date 



wo 


9520671 


A 


03-08-95 


FR 


2716682 


A 


01-09-95 










All 




A 

A 


id- uo - y D 










CA 


2181602 


A 


03-08-95 










EP 


0741793 


A 


13-11-96 










PI 


JUL 


A 












NO 


962950 


A 


12-07-96 










ZA 


9500628 


A 


23-10-95 


wo 


9613598 


A 


09-05-96 


All 


4405596 


A 


23-05-96 


wo 


9622378 


A 


25-07-96 


FR 


2729674 


A 


26-07-96 










AU 


4544396 


A 


07-08-96 



Form PCT ISA 310 j patent family wnti) Uuly 1992} 



• 



* 



COnnECTED 
VERSION* 



LD INTELLECTUAL PROPERTY ORGANIZATIO 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/86, 15/87, 5/10, A61K 35/76, 
48/00 



A3 



(11) International Publication Number: WO 96/39530 

(43) International Publication Date: 12 December 1996 (12.12.96) 



(21) International Application Number: PCT/US967 10245 

(22) International Filing Date: 4 June 1996 (04.06.96) 



(30) Priority Data: 

08/462 iO 1 4 
08/549,489 



5 June 1995 (05.06.95) US 
27 October 1995 (27.10,95) US 



(60) Parent Applications or Grants 

(63) Related by Continuation 
US 

Filed on 
US 

Filed on 



08/462,014 (OP) 
5 June 1995 (05.06.95) 
08/549.489 (CIP) 
27 October 1995 (27.10.95) 



(71) Applicant (for all designated States except US): THE 

TRUSTEES OF THE UNIVERSITY OF PENNSYLVA- 
NIA [US/US]; 133 South 36th Street, Philadelphia, PA 
19104-3246 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): WILSON, James, M. 
[US/US]; 1350 N. Avignon Drive, Gladwyne, PA 19035 
(US). FISHER, Krishna, J. [US/US]; 4006 Pine Street, 
Philadelphia, PA 19104 (US). GAO, Guang-Ping [CN/USJ; 



Robindale Apartment F5, 1925 Lawrence Road, Havertown, 
PA 19083 (US). 

(74) Agents: BAK, Mary, E. et al.; Howson and Howson, Spring 
House Corporate Center, P.O. Box 457, Spring House, PA 
19477 (US). 



(81) Designated States: AL, AM, AU, A2, BB, BG, BR, BY, CA, 
CN, CZ, EE, FI, GE, HU, IL, IS, JP, KE, KG, KP, KR, KZ, 
LK, LR, LS. LT, LV, MD, MG, MK, MN, MW, MX, NO, 
NZ, PL, RO, RU, SD, SG. SI, SK, TJ. TM, TR, TT. UA, 
UG, US. UZ, VN, ARIPO patent (KE, LS, MW, SD, SZ. 
UG), Eurasian patent (AM, AZ, BY, KG, KZ. MD, RU, TJ, 
TM), European patent (AT, BE, CH, DE, DK, ES, FI, FR, 
GB, GR, IE, IT. LU, MC, NL, PT, SE), OAPI patent (BF, 
BJ. CF, CG, CI, CM, G A, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report, 

(88) Date of publication of the international search report: 

5 June 1997 (05.06.97) 



(54) Title: RECOMBINANT ADENOVIRUS AND ADENO- ASSOCIATED VIRUS, CELL LINES. AND METHODS OF 
PRODUCTION AND USE THEREOF 



(57) Abstract 

An adenovirus E1/E4 expressing packaging cell line is provided, which permits the generation of recombinant adenoviruses deleted 
in both gene regions. A method for enhancing the efficiency of transduction of a recombinant AAV into a target ceil is provided by 
infecting a target ceil with a recombinant AAV comprising a selected transgene under the control of regulatory sequences. The infected 
cell is contacted with an agent which facilitates the conversion of single stranded recombinant virus to its double stranded form. 



* (Referred to in PCT Gazette No. 32/1997, Section II) 



* 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Armenia 


GB 


United Kingdom 


MW 


Malawi 


AT 


Austria 


GE 


Georgia 


MX 


Mexico 


AU 


Australia 


GN 


Guinea 


NE 


Niger 


BB 


Barbados 


GR 


Greece 


NL 


Netherlands 


BE 


Belgium 


HU 


Hungary 


NO 


Norway 


BF 


Burkina Paso 


IE 


Ireland 


NZ 


New Zealand 


BG 


Bulgaria 


IT 


Italy 


PL 


Poland 


BJ 


Benin 


JP 


Japan 


PT 


Portugal 


BR 


Brazil 


KE 


Kenya 


RO 


Romania 


BY 


Belarus 


KG 


Kyrgystan 


RU 


Russian Federation 


CA 


Canada 


KP 


Democratic People's Republic 


SD 


Sudan 


CF 


Central African Republic 




of Korea 


SE 


Sweden 


CC 


Congo 


KR 


Republic of Korea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 


SI 


Slovenia 


CI 


Cote d' I voire 


LI 


Liechtenstein 


SK 


Slovakia 


CM 


Cameroon 


LK 


Sri Lanka 


SN 


Senegal 


CN 


China 


LR 


Liberia 


sz 


Swaziland 


cs 


Czechoslovakia 


IT 


Lithuania 


TD 


Chad 


\cz 


Czech Republic 


LU 


Luxembourg 


TG 


Togo 


D£ 


Germany 


LV 


Latvia 


TJ 


Tajikistan 


DK 


Denmark 


MC 


Monaco 


TT 


Trinidad and Tobago 


EE 


Estonia 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MG 


Madagascar 


UG 


Uganda 


Fl 


Finland 


ML 


Mali 


US 


United States of America 


FR 


France 


MN 


Mongolia 


uz 


Uzbekistan 


GA 


Gabon 


MR 


Mauritania 


VN 


Viet Nam 



